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Description 

The present invention relates to a novel family of purified proteins designated BMP-9 proteins and processes for 
obtaining them. These proteins may be used to induce bone and/or cartilage formation and in wound healing and tissue 
5 repair. 

The murine BMP-9 DNA sequence (SEQ ID NO: 1) and amino acid sequence (SEQ ID NO: 2) are set forth in 
Figure 1 . Human BMP-9 sequence is set forth in Figure 3 (SEQ ID NO: 8 and SEQ ID NO: 9). It is contemplated that 
BMP-9 proteins are capable of inducing the formation of cartilage and/or bone. BMP-9 proteins may be further char- 
acterized by the ability to demonstrate cartilage and/or bone formation activity in the rat bone formation assay described 
10 below. 

Murine BMP-9 is characterized by comprising amino acid #319 to #428 of Figure 1 (SEQ ID NO: 2 amino acids 
#1-110). Murine BMP-9 may be produced by culturing a cell transformed with a DNA sequence comprising nucleotide 
#610 to nucleotide #1893 as shown in Figure 1 (SEQ ID NO: 1) and recovering and purifying from the culture medium 
a protein characterized by the amino acid sequence comprising amino acid #319 to #428 as shown in Figure 1 (SEQ 

is id NO: 2) substantially free from other proteinaceous materials with which it is co-produced. 

Human BMP-9 is expected to be homologous to murine BMP-9 and is characterized by comprising amino acid #1 
(Ser, Ala, Gly) to #110 of Figure 3 (SEQ ID NO: 9) (Arg). The invention includes methods for obtaining the DNA se- 
quences encoding human BMP-9. This method entails utilizing the murine BMP-9 nucleotide sequence or portions 
thereof to design probes to screen libraries for the human gene or fragments thereof using standard techniques. Human 

20 BMP-9 may be produced by culturing a cell transformed with the BMP-9 DNA sequence and recovering and purifying 
BMP-9 from the culture medium. The expressed protein is isolated, recovered, and purified from the culture medium. 
The purified expressed protein is substantially free from other proteinaceous materials with which it is co-produced, 
as well as from other contaminants. The recovered purified protein is contemplated to exhibit cartilage and/or bone 
formation activity. The proteins of the invention may be further characterized by the ability to demonstrate cartilage 

25 and/or bone formation activity in the rat bone formation assay described below. 

Human BMP-9 may be produced by culturing a cell transformed with a DNA sequence comprising nucleotide #1 24 
to #453 as shown in SEQ ID NO: 8 and recovering and purifying from the culture medium a protein characterized by 
the amino acid sequence of SEQ ID NO: 9 from amino acid #1 to amino acid #110 substantially free from other pro- 
teinaceous materials with which it is co-produced. 

30 Another aspect of the invention provides pharmaceutical compositions containing a therapeutically effective 

amount of a BMP-9 protein in a pharmaceutical^ acceptable vehicle or carrier. BMP-9 compositions of the invention 
may be used in the formation of cartilage. These compositions may further be utilized for the formation of bone. BMP- 
9 compositions may also be used for wound healing and tissue repair. Compositions of the invention may further include 
at least one other therapeutically useful agent such as the BMP proteins BMP-1 , BMP-2, BMP-3, BMP-4, BMP-5, BMP- 

35 6, and BMP-7 disclosed for instance in PCT publications W088/00205, W089/10409, and W090/11366, and BMP-8, 
disclosed in U.S. application Ser. No. 07/641 ,204 filed January 15, 1991 , Ser. No. 07/525,357 filed May 16, 1990, and 
Ser. No. 07/800,364 filed November 20, 1 991 . 

The compositions of the invention may comprise, in addition to a BMP-9 protein, other therapeutically useful agents 
including growth factors such as epidermal growth factor (EGF), fibroblast growth factor (FGF), transforming growth 

40 factor (TGF-a and TGF-P), and insulin-like growth factor (IGF). The compositions may also include an appropriate 
matrix for instance, for supporting the composition and providing a surface for bone and/or cartilage growth. The matrix 
may provide slow release of the osteoinductive protein and/or the appropriate environment for presentation thereof. 

The BMP-9 compositions may be employed in methods for treating a number of bone and/or cartilage defects, 
periodontal disease and various types of wounds. These methods, according to the invention, entail administering to 

^5 a patient needing such bone and/or cartilage formation wound healing or tissue repair, an effective amount of a BMP- 
9 protein. These methods may also entail the administration of a protein of the invention in conjunction with at least 
one of the novel BMP proteins disclosed in the co-owned applications described above. In addition, these methods 
may also include the administration of a BMP-9 protein with other growth factors including EGF, FGF, TGF-a, TGF-p, 
and IGF. 

so still a further aspect of the invention are DNA sequences coding for expression of a BMP-9, protein. Such se- 

quences include the sequence of nucleotides in a 5' to 3' direction illustrated in Figure 1 (SEQ ID NO: 1 ) and Figure 3 
(SEQ ID NO: 8) or DNA sequences which hybridize under stringent conditions with the DNA sequences of Figure 1 or 
3 and encode a protein having the ability to induce the formation of cartilage and/or bone. Finally, allelic or other 
variations of the sequences of Figure 1 or 3, whether such nucleotide changes result in changes in the peptide sequence 

55 or not, are also included in the present invention. 

A further aspect of the invention includes vectors comprising a DNA sequence as described above in operative 
association with an expression control sequence therefor. These vectors may be employed in a novel process for 
producing a BMP-9 protein of the invention in which a cell line transformed with a DNA sequence encoding a BMP-9 



2 



EP 0 592 562 B1 



protein in operative association with an expression control sequence therefor, is cultured in a suitable culture medium 
and a BMP-9 protein is recovered and purified therefrom. This process may employ a number of known cells both 
prokaryotic and eukaryotic as host cells for expression of the polypeptide. 

Other aspects and advantages of the present invention will be apparent upon consideration of the following detailed 
5 description and preferred embodiments thereof. 

Brief Description of the Drawings 

FIG. 1 comprises DNA sequence and derived amino acid sequence of murine BMP-9 from clone ML14a further 
10 described below. 

FIG. 2 comprises DNA sequence and derived amino acid sequence of human BMP-4 from lambda U20S-3 ATCC 
#40342. 

FIG. 3 comprises DNA sequence and derived amino acid sequence of human BMP-9 from X FIX/H6III ATCC # 
75252. 

75 

Detailed Descripton of the Invention 

The murine BMP-9 nucleotide sequence (SEQ ID NO: 1) and encoded amino acid sequence (SEQ ID NO: 2) are 
depicted in Figure 1. Purified murine BMP-9 proteins of the present invention are produced by culturing a host cell 

20 transformed wth a DNA sequence comprising the DNA coding sequence of Figure 1 (SEQ ID NO: 1 ) from nucleotide 
#610 to nucleotide #1893 and recovering and purifying from the culture medium a protein which contains the amino 
acid sequence or a substantially homologous sequence as represented by amino acid #319 to #428 of Figure 1 (SEQ 
ID NO: 2). The BMP-9 proteins recovered from the culture medium are purified by isolating them from other proteina- 
ceous materials from which they are co-produced and from other contaminants present. 

25 Human BMP-9 nucleotide and amino acid sequence is depicted in SEQ ID No: 8 and 9. Mature human BMP-9 is 

expected to comprise amino acid #1 (Ser, Ala, Gly) to #110 (Arg). 

Human BMP-9 may be produced by culturing a cell transformed with a DNA sequence comprising nucleotide #1 24 
to #453 as shown in SEQ ID NO: 8 and recovering and purifying from the culture medium a protein characterized by 
the amino acid sequence of SEQ ID NO: 9 from amino acid #1 to amino acid #110 substantially free from other pro- 

30 teinaceous materials with which it is co-produced. 

BMP-9 proteins may be characterized by the ability to induce the formation of cartilagerBMP-9 proteins may be 
further characterized by the ability to induce the formation of bone. BMP-9 proteins may be further characterized by 
the ability to demonstrate cartilage and/or bone formation activity in the rat bone formation assay described below. 
The BMP-9 proteins provided herein also include factors encoded by sequences similar to those of Figure 1 and 

35 3 (SEQ ID NO's: 1 and 8), but into which modifications are naturally provided (e.g. allelic variations in the nucleotide 
sequence which may result in amino acid changes in the polypeptide) or deliberately engineered. For example, synthetic 
polypeptides may wholly or partially duplicate continuous sequences of the amino acid residues of Figure 1 of Figure 
3 (SEQ ID NO's: 2 and 9). These sequences, by virtue of sharing primary, secondary, or tertiary structural and confor- 
mational characteristics with bone growth factor polypeptides of Figure 1 and Figure 3 may possess bone growth factor 

40 biological properties in common therewith. Thus, they may be employed as biologically active substitutes for naturally- 
occurring BMP-9 and other BMP-9 polypeptides in therapeutic processes. 

Other specific mutations of the sequences of BMP-9 proteins described herein involve modifications of glycosyla- 
te sites. These modifications may involve O-linked or N-linked glycosylation sites. For instance, the absence of gly- 
cosylate or only partial glycosylation results from amino acid substitution or deletion at asparagine-linked glycosylation 

45 recognition sites. The asparagine-linked glycosylation recognition sites comprise tripeptide sequences which are spe- 
cifically recognized by appropriate cellular glycosylation enzymes. These tripeptide sequences are either asparagine- 
X-threonine or asparagine-X-serine, where X is usually any amino acid. A variety of amino acid substitutions or deletions 
at one or both of the first or third amino acid positions of a glycosylation recognition site (and/or amino acid deletion 
at the second position) results in non-glycosylation at the modified tripeptide sequence. 

so The present invention also encompasses the novel DNA sequences, free of association with DNA sequences 

encoding other proteinaceous materials, and coding on expression for BMP-9 proteins. These DNA sequences include 
those depicted in Figure 1 or Figure 3 (SEQ ID NO's: 1 and 8) in a 5' to 3' direction and those sequences which hybridize 
thereto under stringent hybridization conditions [see, T Maniatis et al, Molecular Cloning (A Laboratory Manual) , Cold 
Spring Harbor Laboratory (1 982), pages 387 to 389] and encode a protein having cartilage and/or bone inducing activity. 

55 Similarly, DNA sequences which code for BMP-9 proteins coded for by the sequences of Figure 1 or Figure 3, but 

which differ in codon sequence due to the degeneracies of the genetic code or allelic variations (naturally-occurring 
base changes in the species population which may or may not result in an amino acid change) also encode the novel 
factors described herein. Variations in the DNA sequences of Figure 1 or Figure 3 (SEQ ID NO: 1 and 8) which are 
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caused by point mutations or by induced modifications (including insertion, deletion, and substitution) to enhance the 
activity, half-life or production of the polypeptides encoded are also encompassed in the invention. 

Another aspect of the present invention provides a novel method for producing BMP-9 proteins. The method of 
the present invention involves culturing a suitable cell line, which has been transformed with a DNA sequence encoding 

s a BMP-9 protein of the invention, under the control of known regulatory sequences. The transformed host cells are 
cultured and the BMP-9 proteins recovered and purified from the culture medium. The purified proteins are substantially 
free from other proteins with which they are co-produced as well as from other contaminants. 

Suitable cells or cell lines may be mammalian cells, such as Chinese hamster ovary cells (CHO). The selection of 
suitable mammalian host cells and methods for transformation, culture, amplification, screening, product production 

10 and purification are known in the art. See, e.g., Gething and Sambrook, Nature, 293: 620-625 (1 981 ), or alternatively, 
Kaufman et al, Mol. Cell. Biol., 5(7): 1750-1 759 (1985) or Howley et al, U.S. Patent 4,419,446. Another suitable mam- 
malian cell line, which is described in the accompanying examples, is the monkey COS-1 cell line. The mammalian 
cell CV-1 may also be suitable. 

Bacterial cells may also be suitable hosts. For example, the various strains of E. coli (e.g., HB101, MCI061) are 

15 well-known as host cells in the field of biotechnology. Various strains of B. subtilis, Pseudomonas, other bacilli and the 
like may also be employed in this method. 

Many strains of yeast cells known to those skilled in the art may also be available as host cells for expression of 
the polypeptides of the present invention. Additionally, where desired, insect cells may be utilized as host cells in the 
method of the present invention. See, e.g. Miller et al, Genetic Engineering, 8:277-298 (Plenum Press 1986) and 

20 references cited therein. 

Another aspect of the present invention provides vectors for use in the method of expression of these novel BMP- 
9 polypeptides. Preferably the vectors contain the full novel DNA sequences described above which encode the novel 
factors of the invention. Additionally the vectors also contain appropriate expression control sequences permitting 
expression of the BMP-9 protein sequences. Alternatively, vectors incorporating modified sequences as described 

25 above are also embodiments of the present invention. The vectors may be employed in the method of transforming 
cell lines and contain selected regulatory sequences in operative association with the DNA coding sequences of the 
invention which are capable of directing the replication and expression thereof in selected host cells. Regulatory se- 
quences for such vectors are known to those skilled in the art and may be selected depending upon the host cells. 
Such selection is routine and does not form part of the present invention. 

30 a protein of the present invention, which induces cartilage and/or bone formation in circumstances where bone is 

not normally formed, has application in the healing of bone fractures and cartilage defects in humans and other animals. 
Such a preparation employing a BMP-9 protein may have prophylactic use in closed as well.as open fracture reduction 
and also in the improved fixation of artificial joints. De novo bone formation induced by an osteogenic agent contributes 
to the repair of congenital, trauma induced, or oncologic resection induced craniofacial defects, and also is useful in 

35 cosmetic plastic surgery. A BMP-9 protein may be used in the treatment of periodontal disease, and in other tooth 
repair processes. Such agents may provide an environment to attract bone-forming cells, stimulate growth of bone- 
forming cells or induce differentiation of progenitors of bone-forming cells. BMP-9 polypeptides of the invention may 
also be useful in the treatment of osteoporosis. A variety of osteogenic, cartilage-inducing and bone inducing factors 
have been described. See, e.g. European patent applications 148,155 and 169,016 for discussions thereof. 

40 The proteins of the invention may also be used in wound healing and related tissue repair. The types of wounds 

include, but are not limited to burns, incisions and ulcers. (See, e.g. PCT Publication W084/01106 for discussion of 
wound healing and related tissue repair). 

It is further contemplated that proteins of the invention may increase neuronal survival and therefore be useful in 
transplantation and treatment of conditions exhibiting a decrease in neuronal survival. 

45 A further aspect of the invention is a therapeutic method and composition for repairing fractures and other conditions 

related to cartilage and/or bone defects or periodontal diseases. The invention further comprises therapeutic methods 
and compositions for wound healing and tissue repair. Such compositions comprise a therapeutically effective amount 
of at least one of the BMP-9 proteins of the invention in admixture with a pharmaceutical^ acceptable vehicle, carrier 
or matrix. 

so it is expected that the proteins of the invention may act in concert with or perhaps synergistically with other related 

proteins and growth factors. Further therapeutic methods and compositions of the invention therefore comprise a ther- 
apeutic amount of at least one BMP-9 protein of the invention with a therapeutic amount of at least one of the other 
BMP proteins disclosed in co-owned applications described above. Such combinations may comprise separate mol- 
ecules of the BMP proteins or heteromolecules comprised of different BMP moieties. For example, a method and 

55 composition of the invention may comprise a disulfide linked dimer comprising a BMP-9 protein subunit and a subunit 
from one of the "BMP" proteins described above. A further embodiment may comprise a heterodimer of BMP-9 moieties. 
Further, BMP-9 proteins may be combined with other agents beneficial to the treatment of the bone and/or cartilage 
defect, wound, or tissue in question. These agents include various growth factors such as epidermal growth factor 
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(EGF), platelet derived growth factor (PDGF), transforming growth factors (TGF-a and TGF-P), and insulin-like growth 
factor (IGF). 

The preparation and formulation of such physiologically acceptable protein compositions, having due regard to 
pH, isotonicity, stability and the like, is within the skill of the art. The therapeutic compositions are also presently valuable 
5 for veterinary applications due to the lack of species specificity in BMP proteins. Particularly domestic animals and 
thoroughbred horses in addition to humans are desired patients for such treatment with BMP-9 of the present invention. 

The therapeutic method includes administering the composition topically, systemically, or locally as an implant or 
device. When administered, the therapeutic composition for use in this invention is, of course, in a pyrogen-free, phys- 
iologically acceptable form. Further, the composition may desirably be encapsulated or injected in a viscous form for 
10 delivery to the site of bone, cartilage or tissue damage. Topical administration may be suitable for wound healing and 
tissue repair. Therapeutically useful agents other than the BMP-9 proteins which may also optionally be included in 
the composition as described above, may alternatively or additionally, be administered simultaneously or sequentially 
with the BMP composition in the methods of the invention. 

Preferably for bone and/or cartilage formation, the composition would include a matrix capable of delivering BMP- 
15 9 or other BMP proteins to the site of bone and/or cartilage damage, providing a structure for the developing bone and 
cartilage and optimally capable of being resorbed into the body. The matrix may provide slow release of BMP-9 and/ 
or the appropriate environment for presentation thereof. Such matrices may be formed of materials presently in use 
for other implanted medical applications. 

The choice of matrix material is based on biocompatibility, biodegradability, mechanical properties, cosmetic ap- 
20 pearance and interface properties. The particular application of the BMP-9 compositions will define the appropriate 
formulation. Potential matrices for the compositions may be biodegradable and chemically defined calcium sulfate, 
tricalciumphosphate, hydroxyapatite, polylactic acid and polyanhydrides. Other potential materials are biodegradable 
and biologically well defined, such as bone or dermal collagen. Further matrices are comprised of pure proteins or 
extracellular matrix components. Other potential matrices are nonbiodegradable and chemically defined, such as sin- 
25 tered hydroxyapatite, bioglass, aluminates, or other ceramics. Matrices may be comprised of combinations of any of 
the above mentioned types of material, such as polylactic acid and hydroxyapatite or collagen and tricalciumphosphate. 
The bioceramics may be altered in composition, such as in calcium-aluminate-phosphate and processing to alter pore 
size, particle size, particle shape, and biodegradability. 

The dosage regimen will be determined by the attending physician considering various factors which modify the 
30 action of the BMP-9 protein, e.g. amount of bone weight desired to be formed, the site of bone damage, the condition 
of the damaged bone, the size of a wound, type of damaged tissue, the patient's age, sex, and diet, the severity of any 
infection, time of administration and other clinical factors. The dosage may vary with the type of matrix used in the 
reconstitution and the types of BMP proteins in the composition. The addition of other known growth factors, such as 
IGF I (insulin like growth factor I), to the final composition, may also effect the dosage. Progress can be monitored by 
35 periodic assessment of bone growth and/or repair, for example, x-rays, histomorphometric determinations and tetra- 
cycline labeling. 

The following examples illustrate practice of the present invention in recovering and characterizing murine BMP- 
9 protein and employing it to recover the human and other BMP-9 proteins, obtaining the human proteins and expressing 
the proteins via recombinant techniques. 

40 

EXAMPLE I 
Murine BMP-9 

45 750,000 recombinants of a mouse liver cDNA library made in the vector lambdaZAP (Stratagene/Catalog #935302) 

are plated and duplicate nitrocellulose replicas made. A fragment of human BMP-4 DNA corresponding to nucleotides 
1330-1627 of Figure 2 (SEQ ID NO: 3) (the human BMP-4 sequence) is 32 P-labeled by the random priming procedure 
of Feinberg et al. [Anal. Biochem. 1 32: 6-13(1 983)] and hybridized to both sets of filters in SHB at 60°C for 2 to 3 days. 
Both sets of filters are washed under reduced stringency conditions (4X SSC, 0.1% SDS at 60°C). Many duplicate 

so hybridizing recombinants of various intensities (approximately 92) are noted. 50 of the strongest hybridizing recom- 
binant bacteriophage are plaque purified and their inserts are transferred to the plasmid Bluescript SK (+/-) according 
to the in vivo excision protocol described by the manufacturer (Stratagene). DNA sequence analysis of several recom- 
binants indicate that they encode a protein homologous to other BMP proteins and other proteins in the TGF-p family. 
The DNA sequence and derived amino acid sequence of one recombinant, designated ML14a, is set forth in Figure 

55 1. (SEQ ID NO: 1) 

The nucleotide sequence of clone ML14a contains an open reading frame of 1284 bp, encoding a BMP-9 protein 
of 428 amino acids. The encoded 428 amino acid BMP-9 protein is contemplated to be the primary translation product 
as the coding sequence is preceded by 609 bp of 5' untranslated sequence with stop codons in all three reading frames. 
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The 428 amino acid sequence predicts a BMP-9 protein with a molecular weight of 48,000 daltons. 

Based on knowledge of other BMP proteins and other proteins within the TGF-p family, it is predicted that the 
precursor polypeptide would be cleaved at the multibasic sequence ARG-ARG-LYS-ARG in agreement with a proposed 
consensus proteolytic processing sequence of ARG-X-X-ARG. Cleavage of the BMP-9 precursor polypeptide at this 

s location would generate a 110 amino acid mature peptide beginning with the amino acid SER at position #319. The 
processing of BMP-9 into the mature form is expected to involve dimerization and removal of the N-terminal region in 
a manner analogous to the processing of the related protein TGF-p [L.E. Gentry, et al., Molec. & Cell. Biol. 8:4162 
(1988); R. Derynck, et al.. Nature 316:701 (1985)]. 

It is contemplated therefore that the mature active species of murine BMP-9 comprises a homodimer of 2 polypep- 

10 tide subunits, each subunit comprising amino acids #319-#428 with a predicted molecular weight of approximately 
12,000 daltons. Further active species are contemplated comprising amino acids #326 - #428 thereby including the 
first conserved cysteine residue. As with other members of the BMP and TGF-p family of proteins, the carboxy-terminal 
region of the BMP-9 protein exhibits greater sequence conservation than the more amino-terminal portion. The percent 
amino acid identity of the murine BMP-9 protein in the cysteine-rich C-terminal domain (amino acids #326 - #428) to 

15 the corresponding region of other human BMP proteins and other proteins within the TGF-p family is as follows: BMP- 
2, 53%; BMP-3, 43%; BMP-4, 53%; BMP-5, 55%; BMP-6, 55%; BMP-7, 53%; Vgl, 50%; GDF-1, 43%; TGF-pi, 32%; 
TGF-p2, 34%; TGF-p3, 34%; inhibin p(B), 34%; and inhibin p(A), 42%. 



20 
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EXAMPLE II 
Human BMP-9 



Murine and human osteoinductive factor genes are presumed to be significantly homologous, therefore the murine 
coding sequence or a portion thereof is used as a probe to screen a human genomic library or as a probe to identify 

25 a human cell line or tissue which synthesizes the analogous human cartilage and/or bone protein. A human genomic 
library (Toole et al., supra ) may be screened with such a probe, and presumptive positives isolated and DNA sequence 
obtained. Evidence that, this recombinant encodes a portion of the human BMP-9 relies of the murine/human protein 
and gene structure homologies. 

Once a recombinant bacteriophage containing DNA encoding portion of the human cartilage and/or bone inductive 

30 factor molecule is obtained, the human coding sequence can be used as a probe to identify a human cell line or tissue 
which synthesizes BMP-9. Alternatively, the murine coding sequence can be used as a probe to identify such human 
cell line or tissue. Briefly described, RN A is extracted from a selected cell or tissue source and either electrophoresed 
on a formaldehyde agarose gel and transferred to nitrocellulose, or reacted with formaldehyde and spotted on nitro- 
cellulose directly. The nitrocellulose is then hybridized to a probe derived from a coding sequence of the murine or 

35 human BMP-9. mRNA is selected by oligo (dT) cellulose chromatography and cDNA is synthesized and cloned in 
lambda gt10 or lambda ZAP by established techniques (Toole et al., supra ). 

Additional methods known to those skilled in the art may be used to isolate the human and other species' BMP-9 
proteins of the invention. 

40 A. Isolation of Human BMP-9 DNA 

One million recombinants of a human genomic library constructed in the vector A.FIX (Stratagene catalog # 944201 ) 
are plated and duplicate nitrocellulose replicas made. Two oligonucleotides probes designed on the basis of nucleotides 
#1665-#1704and #1837-#1876 of the sequence set forth in Figure 1 (SEQ ID NO: 1) are synthesized on an automated 
*5 DNA synthesizer. The sequence of these two oligonucleotides is indicated below: 

# 1 : CTATGAGTGTAAAGGGGGTTGCTTCTTCCCATTGGCTGAT 



# 2 : GTGCCAACCCTCAAGTACCACTATGAGGGGATGAGTGTGG 



These two oligonucleotide probes are radioactively labeled with y^P-ATP and each is hybridized to one set of the 
duplicate nitrocellulose replicas in SHB at 65°C and washed with 1X SSC, 0.1% SDS at 65°C. Three recombinants 
55 which hybridize to both oligonucleotide probes are noted. All three positively hybridizing recombinants are plaque 
purified, bacteriophage plate stocks are prepared and bacteriophage DNA is isolated from each. The oligonucleotide 
hybridizing regions of one of these recombinants, designated HGIII, is localized to a 1.2 kb Pst l/Xba I fragment. This 
fragment is subcloned into a plasmid vector (pGEM-3) and DNA sequence analysis is performed. HGIII was deposited 
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with the ATCC, 12X1 Parklawn Drive, Rockville, Maryland USA on June 16, 1992 under the requirements of the 
Budapest Treaty and designated as ATCC # 75252. This subclone is designated pGEM-111. A portion of the DNA 
sequence of clone pGEM-1 1 1 is set forth in Figure 3 (SEQ ID NO:8/ HUMAN BMP-9 sequence). This sequence encodes 
the entire mature region of human BMP-9 and a portion of the propeptide. It should be noted that this sequence consists 

5 of preliminary data. Particularly, the propeptide region is subject to further analysis and characterization. For example, 
nucleotides #1 through #3 (TGA) encode a translational stop which may be incorrect due to the preliminary nature of 
the sequence. It is predicted that additional sequences present in both pGEM-111 (the 1.2 kb Pstl/Xbal fragment of 
HGIII subcloned into pGEM) and HGIII encode additional amino acids of the human BMP-9 propeptide region. Based 
on knowledge of other BMPs and other proteins within the TGF-p family, it is predicted that the precursor polypeptide 

10 would be cleaved at the multibasic sequence ARG-ARG-LYS-ARG (amino acids # -4 through # -1 of SEQUENCE ID 
NO:9) in agreement with a proposed consensus proteolytic processing sequence ARG-X-X-ARG. Cleavage of the 
human BMP-9 precursor polypeptide at this location would generate a 110 amino acid mature peptide beginning with 
the amino acid SER at position #1 of SEQUENCE ID NO:9 (encoded by nucleotides #1 24 through #1 26 of SEQUENCE 
ID NO:8). The processing of human BMP-9 into the mature form is expected to involve dimerization and removal of 

15 the N-terminal region in a manner analogous to the processing of the related protein TGF-p [L.E. Gentry, et al., Molec. 
& Cell. Biol. 8:4162 (1988); R. Derynck, et al., Nature 316:701 (1985)]. 

It is contemplated therefore that the mature active species of human BMP-9 comprises a homodimer of two 
polypeptide subunits, each subunit comprising amino acids #1 through #110 of SEQUENCE ID NO:9, with a predicted 
molecular weight of 12,000 daltons. Further active species are contemplated comprising amino acids #8 through #110 

20 thereby including the first conserved cysteine residue. As with other members of the BMP and TGF-p family of proteins, 
the carboxy-terminal portion of the human BMP-9 sequence exhibits greater sequence conservation than the amino- 
terminal portion, the percent amino acid identity of the human BMP-9 protein in the cysteine-rich C-terminal domain 
(amino acids #8 through #110) to the corresponding region of other human BMP proteins and other proteins within the 
TGF-p family is as follows: BMP-2, 52%; BMP-3, 40%; BMP-4, 52%; BMP-5, 55%; BMP-6, 55%; BMP-7, 53%; murine 

25 BMP-9, 97%; Vgl, 50%; GDF-1, 44%; TGF-p1, 32%; TGF-P2, 32%; TGF-p3, 32%; inhibin p (B), 35%; and inhibin p 
(A), 41%. 

EXAMPLE III 

30 Rosen Modified Sampath-Reddi Assay 

A modified version of the rat bone formation assay described in Sampath and Reddi, Proc. Natl. Acad. Sci. U.S. 
A., 80:6591-6595 (1983) is used to evaluate bone and/or cartilage activity of the BMP proteins. This modified assay 
is herein called the Rosen-modified Sampath-Reddi assay. The ethanol precipitation step of the Sampath-Reddi pro- 

35 cedure is replaced by dialyzing (if the composition is a solution) or diafiltering (if the composition is a suspension) the 
fraction to be assayed against water. The solution or suspension is then redissolved in 0.1 % TFA, and the resulting 
solution added to 20mg of rat matrix. A mock rat matrix sample not treated with the protein serves as a control. This 
material is frozen and lyophilized and the resulting powder enclosed in #5 gelatin capsules. The capsules are implanted 
subcutaneously in the abdominal thoracic area of 21 - 49 day old male Long Evans rats. The implants are removed 

40 after 7-14 days. Half of each implant is used for alkaline phosphatase analysis [See, A. H. Reddi et al., Proc. Natl 
Acad Sci., 69:1601 (1972)]. 

The other half of each implant is fixed and processed for histological analysis. 1jim glycolmethacrylate sections 
are stained with Von Kossa and acid fuschin to score the amount of induced bone and cartilage formation present in 
each implant. The terms +1 through +5 represent the area of each histological section of an implant occupied by new 

45 bone and/or cartilage cells and matrix. A score of +5 indicates that greater than 50% of the implant is new bone and/ 
or cartilage produced as a direct result of protein in the implant. A score of +4, +3, +2 and +1 would indicate that greater 
than 40%, 30%, 20% and 10% respectively of the implant contains new cartilage and/or bone. In a modified scoring 
method, three non-adjacent sections are evaluated from each implant and averaged. °+/- n indicates tentative identifi- 
cation of cartilage or bone; °+1° indicates >10% of each section being new cartilage or bone; d +2", >25%; °+3 n , >50%; 

so "+4°, -75%; -+5°, >80%. A indicates that the implant is not recovered. 

It is contemplated that the dose response nature of the BMP-9 containing samples of the matrix samples will 
demonstrate that the amount of bone and/or cartilage formed increases with the amount of BMP-9 in the sample. It is 
contemplated that the control samples will not result in any bone and/or cartilage formation. 

As with other cartilage and/or bone inductive proteins such as the above-mentioned "BMP" proteins, the bone and/ 

55 or cartilage formed is expected to be physically confined to the space occupied by the matrix. Samples are also analyzed 
by SDS gel electrophoresis and isoelectric focusing followed by autoradiography The activity is correlated with the 
protein bands and pi. To estimate the purity of the protein in a particular fraction an extinction coefficient of 1 OD/mg- 
cm is used as an estimate for protein and the protein is run on SDS PAGE followed by silver staining or radioiodination 
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and autoradiography. 
EXAMPLE IV 
Expression of BMP-9 

In order to produce murine, human or other mammalian BMP-9 proteins, the DNA encoding it is transferred into 
an appropriate expression vector and introduced into mammalian cells or other preferred eukaryotic or prokaryotic 
hosts by conventional genetic engineering techniques. The preferred expression system for biologically active recom- 
binant human BMP-9 is contemplated to be stably transformed mammalian cells. 

One skilled in the art can construct mammalian expression vectors by employing the sequence of Figure 1 (SEQ 
I D NO: 1 ) or Figure 3 (SEQ ID NO: 8), or other DNA sequences encoding BMP-9 proteins or other modified sequences 
and known vectors, such as pCD [Okayama et al., Mol. Cell Biol., 2:161-170 (1982)], pJL3, pJL4 [Gough et al., EMBO 
J,, 4:645-653 (1985)] and pMT2 CXM. 

The mammalian expression vector pMT2 CXM is a derivative of p91023 (b) (Wong et al., Science 228: 810-815, 
1985) differing from the latter in that it contains the ampicillin resistance gene in place of the tetracycline resistance 
gene and further contains a Xhol site for insertion of cDNA clones. The functional elements of pMT2 CXM have been 
described (Kaufman, R.J., 1985, Proc. Natl. Acad. Sci. USA 82:689-693) and include the adenovirus VA genes, the 
SV40 origin of replication including the 72 bp enhancer, the adenovirus major late promoter including a 5' splice site 
and the majority of the adenovirus tripartite leader sequence present on adenovirus late mRNAs, a 3' splice acceptor 
site, a DHFR insert, the SV40 early polyadenylation site (SV40), and pBR322 sequences needed for propagation in 
E. coli. 

Plasmid pMT2 CXM is obtained by EcoRI digestion of pMT2-VWF, which has been deposited with the American 
Type Culture Collection (ATCC), Rockville, MD (USA) under accession number ATCC 67122. EcoRI digestion excises 
the cDNA insert present in pMT2-VWF, yielding pMT2 in linear form which can be ligated and used to transform E. coli 
HB 101 or DH-5 to ampicillin resistance. Plasmid pMT2 DNA can be prepared by conventional methods. pMT2 CXM 
is then constructed using loopout/in mutagenesis [Morinaga, et al., Biotechnology 84: 636 (1984). This removes bases 
1075 to 1145 relative to the Hind III site near the SV40 origin of replication and enhancer sequences of pMT2. In 
addition it inserts the following sequence: 

5 ' PO-CATGGGCAGCTCGAG-3 ' (SEQ ID NO: 5) 

at nucleotide 1145. This sequence contains the recognition site for the restriction endonuclease Xho I. A derivative of 
pMT2CXM, termed pMT23, contains recognition sites for the restriction endonucleases Pstl, Eco Rl, Sail and Xhol. 
Plasmid pMT2 CXM and pMT23 DNA may be prepared by conventional methods: 

pEMC2bl derived from pMT21 may also be suitable in practice of the invention. pMT21 is derived from pMT2 which 
is derived from pMT2-VWF As described above EcoRI digestion excises the cDNA insert present in pMT-VWF, yielding 
pMT2 in linear form which can be ligated and used to transform E. Coli HB 101 or DH-5 to ampicillin resistance. Plasmid 
pMT2 DNA can be prepared by conventional methods. 

pMT21 is derived from pMT2 through the following two modifications. First, 76 bp of the 5* untranslated region of 
the DHFR cDNA including a stretch of 19 G residues from G/C tailing for cDNA cloning is deleted. In this process, a 
Xhol site is inserted to obtain the following sequence immediately 



upstream from DHFR: 5' - CTGCAG GCGAGCCT GAATTCCTCGAG CCATC ATG -3 9 

Pstl Eco RI Xhol 

(SEQ ID NO: 6) 

Second, a unique Clal site is introduced by digestion with EcoRV and Xbal, treatment with Klenow fragment of DNA 
polymerase I, and ligation to a Clal linker (C ATCGATG). This deletes a 250 bp segment from the adenovirus associated 
RNA (VAI) region but does not interfere with VAI RNA gene expression or function. pMT21 is digested with EcoRI and 
Xhol, and used to derive the vector pEMC2B1 . 

A portion of the EMCV leader is obtained from pMT2-ECAT1 [S.K. Jung, et al, J. Virol 63:1651-1660 (1989)] by 
digestion with Eco Rl and Pstl, resulting in a 2752 bp fragment. This fragment is digested with Taql yielding an Eco 
Rl-Taql fragment of 508 bp which is purified by electrophoresis on low melting agarose gel. A 68 bp adapter and its 
complementary strand are synthesized with a 5' Taql protruding end and a 3' Xhol protruding end which has the following 
sequence: 
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5 ' -C^GGTTAAAAAACGTCTAGGCCCCCCGAACCACGGGGACGTGGTTTTCCTTT 
Taql 

5 GAAAAACACG ATT GC-3 ' 

Xhol (SEQ ID NO: 7) 

This sequence matches the EMC virus leader sequence from nucleotide 763 to 827. It also changes the ATG at position 
10 10 within the EMC virus leader to an ATT and is followed by a Xhol site. A three way ligation of the pMT21 Eco Rl- 
Xhol fragment, the EMC virus EcoRI-Taql fragment, and the 68 bp oligonucleotide adapter Taql-Xhol adapter resulting 
in the vector pEMC2^1 . 

This vector contains the SV40 origin of replication and enhancer, the adenovirus major late promoter, a cDN A copy 
of the majority of the adenovirus tripartite leader sequence, a small hybrid intervening sequence, an SV40 polyade- 

1$ nylation signal and the adenovirus VA I gene, DHFR and (^-lactamase markers and an EMC sequence, in appropriate 
relationships to direct the high level expression of the desired cDNA in mammalian cells. 

The construction of vectors may involve modification of the BMP-9 DNA sequences. For instance, BMP-9 cDNA 
can be modified by removing the non-coding nucleotides on the 5' and 3' ends of the coding region. The deleted non- 
coding nucleotides may or may not be replaced by other sequences known to be beneficial for expression. These 

20 vectors are transformed into appropriate host cells for expression of BMP-9 proteins. 

One skilled in the art can manipulate the sequences of Figure 1 or Figure 3 (SEQ ID NO: 1 and 8) by eliminating 
or replacing the mammalian regulatory sequences flanking the coding sequence with bacterial sequences to create 
bacterial vectors for intracellular or extracellular expression by bacterial cells. For example, the coding sequences 
could be further manipulated (e.g. ligated to other known linkers or modified by deleting non-coding sequences there- 

25 from or altering nucleotides therein by other known techniques). The modified BMP-9 coding sequence could then be 
inserted into a known bacterial vector using procedures such as described in T Taniguchi et al., Proc. Natl Acad. Sci. 
USA, 77:5230-5233 (1980). This exemplary bacterial vector could then be transformed into bacterial host cells and a 
BMP-9 protein expressed thereby. For a strategy for producing extracellular expression of BMP-9 proteins in bacterial 
cells, see, e.g. European patent application EPA 177,343. 

30 Similar manipulations can be performed for the construction of an insect vector [See, e.g. procedures described 

in published European patent application 155,476] for expression in insect cells. A yeast vector could also be con- 
structed employing yeast regulatory sequences for intracellular or extracellular expression of the factors of the present 
invention by yeast cells. [See, e.g., procedures described in published PCT application W086/00639. and European 
patent application EPA 123,289]. 

35 a method for producing high levels of a BMP-9 protein of the invention in mammalian cells may involve the con- 

struction of cells containing multiple copies of the heterologous BMP-9 gene. The heterologous gene is linked to an 
amplifiable marker, e.g. the dihydrofolate reductase (DHFR) gene for which cells containing increased gene copies 
can be selected for propagation in increasing concentrations of methotrexate (MTX) according to the procedures of 
Kaufman and Sharp, J. Mol. Biol., 1 59:601 -629 (1 982). This approach can be employed with a number of different cell 

to types. 

For example, a plasmid containing a DNA sequence for a BMP-9 of the invention in operative association with 
other plasmid sequences enabling expression thereof and the DHFR expression plasmid pAdA26SV(A)3 [Kaufman 
and Sharp, Mol. Cell, biol., 2:1304 (1982)] can be co-introduced into DHFR-deficient CHO cells, DUKX-BII, by various 
methods including calcium phosphate coprecipitation and transfection, electroporation or protoplast fusion. DHFR ex- 

45 pressing transformants are selected for growth in alpha media with dialyzed fetal calf serum, and subsequently selected 
for amplification by growth in increasing concentrations of MTX (e.g. sequential steps in 0.02, 0.2, 1.0 and 5uM MTX) 
as described in Kaufman et al., Mol Cell Biol., 5:1750 (1983). Transformants are cloned, and biologically active BMP- 
9 expression is monitored by the Rosen-modified Sampath - Reddi rat bone formation assay described above in Ex- 
ample III. BMP-9 expression should increase with increasing levels of MTX resistance. BMP-9 polypeptides are char- 

50 acterized using standard techniques known in the art such as pulse labeling with [35S] methionine or cysteine and 
polyacrylamide gel electrophoresis. Similar procedures can be followed to produce other related BMP-9 proteins. 

A. BMP-9 Vector Construction 

55 in order to produce human BMP-9 proteins of the invention DNA sequences encoding the mature region of the 

human BMP-9 protein may be joined to DNA sequences encoding the propeptide region of the murine BMP-9 protein. 
This murine/human hybrid DNA sequence is inserted into an appropriate expression vector and introduced into mam- 
malian cells or other preferred eukaryotic or prokaryotic hosts by conventional genetic engineering techniques. The 
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construction of this murine/human BMP-9 containing expression plasmid is described below. 

A derivative of the human BMP-9 sequence (SEQ ID NO:8) comprising the nucleotide sequence from nucleotide 
#105 to #470 is specifically amplified. The following oligonucleotides are utilized as primers to allow the amplification 
of nucleotides #105 to #470 of the human BMP-9 sequence (SEQ ID NO:8) from clone pGEM-111 described above. 

5 

#3 ATCGGGCCCCTTTTAGCCAGGCGGAAAAGGAG 



10 # 4 AGCGAATTCCCCGCAGGCAGATACTACCTG 

This procedure generates the insertion of the nucleotide sequence ATCGGGCCCCT immediately proceeding nucle- 
otide #105 and the insertion of the nucleotide sequence GAATTCGCT immediately following nucleotide #470. The 
addition of these sequences results in the creation of an Apa I and EcoR I restriction endonuclease site at the respective 
15 ends of the specifically amplified DNA fragment. The resulting 374 bp Apa l/EcoR I fragment is subcloned into the 
plasmid vector pGEM-7Zf(+) (Promega catalog# p2251 ) which has been digested with Apa I and EcoR I. The resulting 
clone is designated phBMP9mex-1. 

The following oligonucleotides are designed on the basis of murine BMP-9 sequences (SEQ ID NO:1) and are 
modified to facilitate the construction of the murine/human expression plasmid referred to above: 

20 

#5 

GATTCCGTCGACCACCATGTCCCGTGGGGCCTGGTCTAGATGGATACACAGCTGTGGGGCC 

25 

# 6 CCACAGCTGTGTATCCATCTAGACCAGGCCCCAGGGGACATGGTGGTCGACG 

30 These oligonucleotides contain complimentary sequences which upon addition to each other facilitate the annealing 
(base pairing) of the two individual sequences, resulting in the formation of a double stranded synthetic DNA linker 
(designated LINK-1) in a manner indicated below: 

35 1 5 10 20 30 40 50 60 

iii i i i i i 

iii i i t i i 

#5GATTCCGTCGACCACCATGTCCCCTGGGGCCTGGTCTAGATGGATACACAGCTGTGGGGCC 

GCAGCTGGTGGTACAGGGGACCCCGGACCAGATCTACCTATGTGTCGACACC # 6 

40 

This DNA linker (LINK-1 ) contains recognition sequences of restriction endonucleases needed to facilitate subsequent 
manipulations required to construct the murine/human expression plasmid, as well as sequences required for maximal 
expression of heterologous sequences in mammalian cell expression systems. More specifically (referring to the se- 
quence numbering of oligonucleotide #5/LI NK-1 ): nucleotides #1 -#1 1 comprise recognition sequences for the restriction 

45 endonucleases BamH I and Sal I, nucleotides #11 -#15 allow for maximal expression of heterologuos sequences in 
mammallian cell expression systems, nucleotides #16-#31 correspond to nucleotides #610-#625 of the murine BMP- 
9 sequence (SEQ ID NO: 1 ), nucleotides #32 -#33 are inserted to facilitate efficient restriction digestion of two adjacent 
restriction endonuclease sites (EcoO109 I and Xba I), nucleotides #34-#60 correspond to nucleotides #1515-#1541 
of the murine BMP-9 sequence (SEQ ID NO: 1 ) except that nucleotide #58 of synthetic oligonucloetide #5 is a G rather 

so than the A which appears at position #1539 of SEQ ID NO:1 (This nucleotide conversion results in the creation of an 
Apa I restriction endonuclease recognition sequence, without altering the amino acid sequence it is intended to encode, 
to facilitate further manipulations of the murine/human hybrid expression plasmid. LINK-1 (the double stranded product 
of the annealing of oligonucleotides #5 and #6) is subcloned into the plasmid vector pGEM-7Zf(+) which has been 
digested with the restriction endonucleases Apa I and BamH I. This results in a plasmid in which the sequences normally 

55 present between the Apa I and BamH I sites of the pGEM-7Zf(+) plasmid polylinker are replaced with the sequences 
of LINK-1 described above. The resulting plasmid clone is designated pBMP-9link. 

pBMP-9link is digested with the restriction endonucleases BamH I and Xba I resulting in the removal nucleotides 
#1-#34of LINK-1 (refer to the numbering of oligo #5). Clone ML14a, which contains an insert comprising the sequence 
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set forth in SEQ ID NO:1 , is also digested with the restriction endonucleases BamH I and Xba I resulting in the removal 
of sequences comprising nucloetides #1 -#1515 of SEQUENCE ID NO:1 (murine BMP-9). This BamH l/Xba I fragment 
of mouse BMP-9 is isolated from the remainder of the ML14a plasmid clone and subcloned into the BamH l/Xba I sites 
generated by the removal of the synthetic linker sequences described above. The resulting clone is designated p302. 

5 The p302 clone is digested with the restriction endonuclease EcoO109 I resulting in the excision of nucloetides 

corresponding to nucleotides #621 -#151 5 of the murine BMP-9 sequence (SEQ ID NO: 1) and nucleotides #35-#59 of 
LINK-1 (refer to numbering of oligonucleotide #5). It should be noted that the Apa I restriction site created in LINK-1 
by the A to G conversion described above is a subset of the recognition sequence of EcoO109 I, therefore digestion 
of p302 with EcoO109 1 cleaves at the Apa I site as well as the naturally occuring murine EcoOl 09 1 (location #61 9-#625 

10 of SEQ ID NO:1 ) resulting in the excision of a 920 bp EcoOl 09 1/EcoOI 09 1 (Apa I) fragment comprising the sequences 
described above. This 920 EcoOl 09 l/EcoO109 I (Apa I) fragment is isolated from the remainder of the p302 plasmid 
clone and subcloned into clone pBMP-9link which has been similarly digested with EcoOl 09 I. It should be noted that 
the nucleotides GG (#32-#33 of oligonucleotide #5) originally designed to facilitate a more complete digestion of the 
two adjacent restriction sites EcoO109 I and Xba I of LINK-1, which is now a part of pBMP-9link (described above), 

is results in the creation of Dcm methylation recognition sequence. The restriction nuclease EcoOl 09 I is sensitive to 
Dcm methylation and therefore cleavage of this sequence (nucleotides #25-#31 of oligonucleotide #5/LINK-1) by the 
restriction endonuclease EcoOl 09 1 is prevented at this site. Therefore the plasmid clone pBMP-9link is cleaved at the 
Apa I site but not at the EcoOl 09 1 site upon digestion with the restriction endonuclease EcoOl 09 I as described above, 
preventing the intended removal of the sequences between the EcoOl 09 I and Xba I site of LINK-1 (#32-#55 defined 

20 by the numbering of oligonucleotide #5). This results in the insertion of the 920 bp EcoOl 09 l/Apa I fragment at the 
EcoOl 09 I (Apa I) site of pBMP-9link. The resulting clone is designated p318. 

Clone p318 is digested with the restriction endonucleases Sal I and Apa I, resulting in the excision of sequences 
comprising nucleotides #6-#56 of LINK-1 (refer to oligo #5 for location), nucleotides #621 -#1515 of murine BMP-9 
(SEQ ID NO:1), and nucleotides #35-#60 of LINK-1 (refer to oligo #5 for location). The resulting 972 bp Sal l/Apa I 

25 fragment described above is isolated from the remainder of the p318 plasmid clone and will be utilized in subsequent 
manipulations. 

The clone phBMP9mex-1 (described above), which contains DNA sequences which encode the entire mature 
region and portions of the propeptide of the human BMP-9 protein, is digested with the restriction endonucleases Apa 
I and EcoR I. This results in the excision of a 374 bp fragment comprising nucleotides #105-#470 of the human BMP- 
30 9 sequence (SEQ ID NO:8) and the additional nucleotides of oligonucleotide primers #3 and #4 which contain the 
recognition sequences for the restriction endonucleases Apa I and EcoR I. This 374 bp Apa l/EcoR I fragment is 
combined with the 972 bp Sal l/Apa I fragment from p138 (isolation described above) and ligated to the mammalian 
cell expression plasmid pED6 (a derivative of pEMC2p1) which has been digested with Sal I and EcoR I. The resulting 
clone is designated p324. 

35 The clone ML14a (murine BMP-9) is digested with EcoOl 09 I and Xba I to generate a fragment comprising nu- 

cleotides #621 -#151 5 of SEQ ID NO:1. 

The following oligonucleotides are synthesized on an automated DNA synthesizer and combined such that their 
complimentary sequences can base pair (anneal) with each other to generate a double stranded synthetic DNA linker 
designated LINK-2: 

40 

#7 TCGACCACCATGTCCCCTGG 



*° #8 GCCCCAGGGGACATGGTGG 

This double stranded synthetic DNA linker (LINK-2) anneals in such a way that it generates single stranded ends which 
are compatible to DNA fragments digested with Sal I (one end) or EcoOl 09 I (the other end) as indicated below: 

so 

#7 TCGACCACCATGTCCCCTGG 

GGTGGTACAGGGGACCCCG #8 * 

55 This LINK-2 synthetic DNA linker is ligated to the 895 bp EcoOl 09 l/Xba I fragment comprising nucleotides #621 -#1 51 5 
of murine BMP-9 (SEQ ID NO:1) described above. This results in a 915 bp Sal l/Xba I fragment. 

The clone p324 is digested with Sal l/Xba I to remove sequences comprising nucleotides #6-#56 of LINK-1 (refer 
to oligo #5 for location) and nucleotides #621 -#151 5 of murine BMP-9 (SEQ ID NO:1). The sequences comprising 
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nucleotides #35-#60 of LINK-1 (refer to oligo #5 for location) and the sequences comprising the 374 bp Apa l/EcoR I 
fragment (human BMP-9 sequences) derived from phBMP9mex-1 remain attached to the pED6 backbone. The 915 
bp Sal l/Xba I fragment comprising LINK-2 sequences and nucleotides #621 -#151 5 of murine BMP-9 (SEQ ID NO:1) 
is ligated into the p324 clone from which the Sal I to Xba I sequences described above have been removed. 

The resulting plasmid is designated BMP9fusion and comprises LINK-2, nucleotides #621 -#1551 of murine BMP- 
9 (SEQ ID NO:1), nucleotides #35-#59 of LINK-1 (refer to the numbering of oligonucleotide #5), and the 374 bp Apa 
l/EcoR I fragment (human BMP-9) derived from clone pBMP9mex-1 (described above) inserted between the Sal I and 
EcoR I sites of the mammalian cell expression vector pED6. 

BMP9 fusion is transfected into CHO cells using standard techniques known to those having ordinary skill in the 
art to create stable cell lines capable of expressing human BMP-9 protein. The cell lines are cultured under suitable 
culture conditions and the BMP-9 protein is isolated and purified from the culture medium. 

EXAMPLE V 

*5 Biological Activity of Expressed BMP-9 

To measure the biological activity of the expressed BMP-9 proteins obtained in Example IV above, the proteins 
are recovered from the cell culture and purified by isolating the BMP-9 proteins from other proteinaceous materials 
with which they are co-produced as well as from other contaminants. The pu rified protein may be assayed in accordance 
20 with the rat bone formation assay described in Example III. 

Purification is carried out using standard techniques known to those skilled in the art. It is contemplated, as with 
other BMP proteins, that purification may include the use of Heparin sepharose. 

Protein analysis is conducted using standard techniques such as SDS-PAGE acrylamide [U.K. Laemmli, Nature 
227:680 (1 970)] stained with silver [R.R. Oakley, et al. Anal. Biochem. 105: 361 (1 980)] and by immunoblot [H. Towbin, 
25 et al. Proc, Natl. Acad. Sci. USA 76:4350 (1 979)] 

The foregoing descriptions detail presently preferred embodiments of the present invention. Numerous modifica- 
tions and variations in practice thereof are expected to occur to those skilled in the art upon consideration of these 
descriptions. 

30 SEQUENCE LISTING 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Wozney, John M. Celeste, Anthony 

35 

(ii) TITLE OF INVENTION: BMP-9 COMPOSITIONS 
(Hi) NUMBER OF SEQUENCES: 9 

40 (iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Genetics Institute, Inc. 

(B) STREET: Legal Affairs - 87 CambridgePark Drive 

(C) CITY: Cambridge 
45 (D) STATE: MA 

(E) COUNTRY: US 

(F) ZIP: 02140 

(v) COMPUTER READABLE FORM: 

50 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

55 

(vi) CURRENT APPLICATION DATA: 
(A) APPLICATION NUMBER: US 
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(B) FILING DATE: 

(C) CLASSIFICATION: 

(viii) ATTORNEY/AGENT INFORMATION: 

5 

(A) NAME: Kapinos, Ellen J. 

(B) REGISTRATION NUMBER: 32,245 

(C) REFERENCE/DOCKET NUMBER: Gl 5186A 

10 (ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (617) 876-1170 

(B) TELEFAX: (617) 876-5851 

is (2) INFORMATION FOR SEQ ID NO:1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 2447 base pairs 
20 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNAto mRNA 

25 

(iii) HYPOTHETICAL: NO 

(iv) ANTI-SENSE: NO 

30 (vi) ORIGINAL SOURCE: 

(A) ORGANISM: Mus musculus 

(B) STRAIN: C57B46xCBA 
(F) TISSUE TYPE: liver 

35 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: Mouse livercDNA 

(B) CLONE: ML14A 

40 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 
4S (ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1564.. 1893 

so (ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 610.. 1896 

ss (Ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..2447 



13 



EP 0 592 562 B1 



(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 



5 


VA1 IAA1 AAA 


TATTAAGTAT 


TGGAATTAGT 


GAAATTGGAG 


TTCCTTGTGG 


AAGGAAGTGG 


60 






ITITXAGaTT 


GTGTCGGAAG 


CCTGTAATTA 


CGGCTCCAGC 


TCATAGTGGA 


120 






TTAGATTTAT 


GGATAGTTGG 


GTAGTAGGTG 


TAAATGTATG 


TGGTAAAAGG 


180 


10 






AATAAATATG 


ATTAGGGAAA 


CAATTATTAG 


GGTTCATGTT 


240 




V-0 1 111 JA* 


GTGTGTGGAT 


TAG CATT ATT 


TGTTTGATAA 


TAAGTTTAAC 


TAGTCAGTGT 


300 




IwAAAuAA X 


GGAGACGGi 1 


GTTGATTAGG 


CGTTTTGAGG 


ATGGGAATAG 


GATTGAAGGA 


360 


15 


AATATAATGA 


TGGCTACAAC 


GATTGGGAAT 


CCTATTATTG 


TTGGGGTAAT 


GAATGAGGCA 


420 




AATAGATTTT 


CGTTCATTTT 


AATTCTCAAG 


GGGTTTTTAC 


TTTTATGTTT 


GTTAGTGATA 


480 




TTGGTGAGTA 


GGCCAAGGGT 


TAATAGTGTA 


ATTGAATTAT 


AGTGAAATCA 


TATTACTAGA 


540 


20 


CCTGATGTTA 


GAAGGAGGGC 


TGAAAAGGCT 


CCTTCCCTCC 


CAGGACAAAA 


CCGGAGCAGG 


600 



GCCACCCGG ATG TCC CCT GGG GCC TTC CGG GTG GCC CTG CTC CCG CTG 648 

Met Ser Pro Gly Ala Phe Arg Val Ala Leu Leu Pro Leu 
-318 -315 -310 



25 TTC CTG CTG GTC TGT GTC ACA CAG CAG AAG CCG CTG CAG AAC TGG GAA 696 
Phe Leu Leu Val Cys Val Thr Gin Gin Lys Pro Leu Gin Asn Trp Glu 
-305 -300 -295 -290 

CAA GCA TCC CCT GGG GAA AAT GCC CAC AGC TCC CTG GGA TTG TCT GGA 7 44 

Gin Ala Ser Pro Gly Glu Asn Ala His Ser Ser Leu Gly Leu Ser Gly 
30 -2 8 5 -2 8 0 - 2 7 5 

GCT GGA GAG GAG GGT GTC TTT GAC CTG CAG ATG TTC CTG GAG AAC ATG 79 2 

Ala Gly Glu Glu Gly Val Phe Asp Leu Gin Met Phe Leu Glu Asn Met 

-270 -265 -260 

35 AAG GTG GAT TTC CTA CGC AGC CTT AAC CTC AGC GGC ATT CCC TCC CAG 6 4 0 

Lys Val Asp Phe Leu Arg Ser Leu Asn Leu Ser Gly lie Pro Ser Gin 



(2) INFORMATION FOR SEQ ID NO:9: 

40 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 151 amino acids 

(B) TYPE: amino acid 
45 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

50 



55 
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* Thr Arg Glu 
-41 -40 



Gin Val Arg Ala 
-25 

Gly Ser Thr Leu 



Cys Gin Lys Thr 

10 

Ser Trp He He 
25 

Gly Cy6 Phe Phe 
40 

He Val Gin Thr 



Ala Cys Cys Val 

75 

Asp Asp Met Gly 

90 

Val Ala Glu Cys 
105 



Cys Ser Arg Ser 
-35 

Val Thr Arg Arg 

-20 

Ala Arg Arg Lys 
-5 

Ser Leu Arg Val 

• 15 

Ala Pro Lys Glu 

30 

Pro Leu Ala Asp 
45 

Leu Val His Leu 
60 

Pro Thr Lys Leu 



Val Pro Thr Leu 

95 

Gly Cys Arg 
110 



Cys Pro Arg Thr 

-30 

Thr Arg Met Ala 

-15 

Arg Ser Ala Gly 
1 

Asn Phe Glu Asp 

Tyr Glu Ala Tyr 

35 

Asp Val Thr Pro 

50 

Lys Phe Pro Thr 

65 

Ser Pro He Ser 
80 

Lys Tyr His Tyr 



Ala Pro Gin Arg 



His Val Ala Ala 

-10 

Ala Gly Ser His 
5 

He Gly *rp Asp 
20 

Glu Cys Lys Gly 



Thr Lys His Ala 

55 

Lys Val Gly Lys 
Val Leu Tyr Lys 

85 

Glu Gly Met Ser 

100 



(ix) FEATURE: 

# 

(A) NAME/KEY: exon 

(B) LOCATION: 1..470 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 1..456 

(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 124..453 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(B) LOCATION: 1..470 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
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TGA ACA AGA GAG TGC TCA AGA AGC TGT CCA AGG ACG GCT CCA CAG AGG 

* Thr Arg Glu cys ser Arg Ser Cys Pro Arg Thr Ala Pro Gin Arg 
-41 -40 -35 -30 

CAG GTG AGA GCA GTC ACG AGG AGG ACA CGG ATG GCG CAC GTG GCT GCG 

Gin Val Arg Ala Val Thr Arg Arg Thr Arg Met Ala His Val Ala Ala 
-25 -20 -15 -10 

GGG TCG ACT TTA GCC AGG CGG AAA AGG AGC GCC GGG GCT GGC AGC CAC 
Gly Ser Thr Leu Ala Arg Arg Lys Arg Ser Ala Gly Ala Gly Ser His 

-5 i- 1 .5 

TGT CAA AAG ACC TCC CTG CGG GTA AAC TTC GAG GAC ATC GGC TGG GAC 

cys Gin Lys Thr Ser Leu Arg Val Asn Phe Glu Asp lie Gly Trp Asp 
10 15 20 

AGC TGG ATC ATT GCA CCC AAG GAG TAT GAA GCC TAC GAG TGT AAG GGC 
Ser Trp lie lie Ala Pro Lys Glu Tyr Glu Ala Tyr Glu Cys Lys Gly 
25 30 ' 35 

GGC TGC TTC TTC CCC TTG GCT GAC GAT GTG ACG CCG ACG AAA CAC GCT 

Gly Cys Phe Phe Pro Leu Ala Asp Asp Val Thr Pro Thr Lys His Ala 
40 45 50 55 

* 

ATC GTG CAG ACC CTG GTG CAT CTC AAG TTC CCC ACA AAG GTG GGC AAG 
lie Val Gin Thr Leu Val His Leu Lys Phe Pro Thr Lys Val Gly Lys 

60 65 70 

GCC TGC TGT GTG CCC ACC AAA CTG AGC CCC ATC TCC GTC CTC TAC AAG 

Ala Cys Cys Val Pro Thr Lys Leu ser Pro lie Ser Val Leu Tyr Lys 

75 80 85 

♦ 

GAT GAC ATG GGG GTG CCC ACC CTC AAG TAC CAT TAC GAG GGC ATG AGC 
Asp Asp Met Gly Val Pro Thr Leu Lys Tyr His Tyr Glu Gly Met Ser 
90 95 100 

GTG GCA GAG TGT GGG TGC AGG TAGTATCTGC CTGCGGG 

Val Ala Glu Cys Gly Cys Arg 

105 110 



CATGGGCAGC TCGAG 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 

CTGCAGGCGA GCCTGAATTC CTC GAG C CAT CATG 



(2) INFORMATION FOR SEQ ID NO:7: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 68 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

CGAGGTTAAA AAACGTCTAG GCCCCCCGAA CCACGGGGAC GTGGTTTTCC TTTGAAAAAC 
ACGATTGC 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 470 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(iii) HYPOTHETICAL: NO 

(v) FRAGMENT TYPE: C-terminal 

(vi) ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(H) CELL LINE: W1 38 (genomic DNA) 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: human genomic library 

(B) CLONE: lambda 111-1 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 
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Arg He Asn He Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro 
-115 -no -105 

Gly His lieu He Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn 
-100 -95 -90 -85 

Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp 

-80 -75 -70 

Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His 

-65 -60 -55 

Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser Arg 
-50 -45 -40 

Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu 
-35 -30 -25 

Val Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg 
-20 -15 -10 -5 

Arg Ala Lys Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys 

1 5 io 

Asn Lys Asn Cys Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val 

15 20 25 

Gly Trp Asn Asp Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr 
30 35 40 

Cys His Gly Asp Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr 

45 50 55 60 

Asn His Ala He Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He 

65 70 75 

Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ses Ala He Ser Met Leu 

80 85 90 

Tyr Leu Asp Glu Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met 

95 100 105 

Val Val Glu Gly Cys Gly Cys Arg 

110 115 



(2) INFORMATION FOR SEQ ID NO:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA to mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 
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10 



TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG ATATACACAC 1666 

cys Gly Cys Arg 
115 

CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 1726 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA 1786 

ATCCCTAAAC ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT 1846 

TGATCATATA TTTTGACAAA ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG 1906 

■ 

AGTCATTATT TTAAAAAAAA AAAAAAAACT CTAGAGTCGA CGGAATTC 1954 



1S (2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 408 amino acids 
20 (B) TYPE: amino acid 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

25 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

Met He Pro Gly Asn Arg Met Leu Met Val Val Leu Leu Cys Gin Val 

-292 -290 -285 -280 

30 Leu Leu Gly Gly Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys 

-275 -270 



35 



40 



45 



50 



55 



Lys Lys Val Ala Glu He Gin Gly Hie Ala Gly Gly Arg Arg Ser Gly 

-260 -255 -250 -245 

Gin Ser His Glu Leu Leu Arg Asp Phe Glu Ala. Thr Leu Leu Gin Met 

-240 -235 -230 

Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro 

-225 -220 -215 

Asp Tyr Met Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu 

-210 -205 -200 

Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser 
-195 -190 -185 

Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn 

-180 -175 -170 -165 

He Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu 

-160 -155 -150 

Ser Ser He Pro Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu 

-145 -140 -135 

Phe Arg Glu Gin Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His 
-130 -125 -120 
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GTG GAC GAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT 
Val Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg lie Asn lie 

-125 -120 -115 

TAT GAG GTT ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC 
Tyr Glu Val Met Lys Pro Pro Ala Glu Val Val Pro Gly His Leu He 
-110 -105 -100 

ACA CGA CTA CTG GAC ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG 

Thr Arg Leu Leu Asp Thr Arg Leu Val His His Asn Val Thr Arg Trp 
-95 -90 -85 

GAA ACT TTT GAT GTG AGC CCT GCG GTC CTT CGC TGG ACC CGG GAG AAG 
Glu Thr Phe Asp Val Ser Pro Ala Val Leu Arg Trp Thr Arg Glu Lys 

-80 -75 -70 -65 

CAG CCA AAC TAT GGG CTA GCC ATT GAG GTG ACT CAC CTC CAT CAG ACT 

Gin Pro Asn Tyr Gly Leu Ala He Glu Val Thr His Leu His Gin Thr 

-60 -55 -50 



CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC CGA TCG TTA CCT CAA 

Arg Thr His Gin Gly Gin His Val Arg He Ser Arg Ser Leu Pro Gin 

-45 -40 -35 

it. 

GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC ACC TTT GGC 
Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val Thr Phe Gly 
-30 -25 -20 



CAT GAT GGC 
His Asp Gly 
-15 



GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG CGT 

Gly Hi6 Ala Leu Thr Arg Arg Arg Arg Ala Lys Arg 
-10 -5 



AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC 
Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys 
15 10 15 

m 

CGG CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC 
Arg Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp 

20 25 30 

TGG ATT GTG GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC 
Trp He Val Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp 

35 40 45 

TGC CCC TTT CCA CTG GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT 
Cys Pro Phe Pro Leu Ala Asp His Leu Asn Ser Thr Asn His Ala He 
50 55 60 

GTG CAG ACC CTG GTC AAT TCT GTC AAT TCC AGT ATC CCC AAA GCC TGT 
Val Gin Thr Leu Val Asn Ser Val Asn Ser Ser He Pro Lys Ala Cys 

65 70 75 80 

TGT GTG CCC ACT GAA CTG AGT GCC ATC TCC ATG CTG TAC CTG GAT GAG 
Cys Val Pro Thr Glu Leu Ser Ala He Ser Met Lieu Tyr Leu Asp Glu 

85 90 95 



TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG ATG GTA GTA GAG GGA 
Tyr Asp Lys Val Val Leu Lys Asn Tyr Gin Glu Met Val Val Glu Gly 

100 105 110 



(B) LOCATION: 9.. 1934 
(xi) SAQUENCE DESCRIPTION: SEQ ID NO:3: 
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10 



40 



45 



CTCTA6AG6G 


CAGAGGAGGA 


GGGAGGGAGG 


GAAGGAGCGC 


GGAGCCCGGC 


CCGGAAGCTA 


60 


GGTGAGTGTG 


GCATCCGAGC 


TGAGGGACGC 


GAGCCTGAGA 


CGCCGCTGCT 


GCTCCGGCTG 


120 


AGTATCTAGC 


TTGTCTCCCC 


GATGGGATTC 


CCGTCCAAGC 


TATCTCGAGC 


CTGCAGCGCC 


180 


ACAGTCCCCG 


GCCCTCGCCC 


AGGTTCACTG 


CAACCGTTCA 


GAGGTCCCCA 


GGAGCTGCTG 


240 


CTGGCGAGCC 


CGCTACTGCA 


GGGACCTATG 


GAGCCATTCC 


GTAGTGCCAT 


CCCGAGCAAC 


300 


GCACTGCTGC 


AGCTTCCCTG 


agcctttcca 


GCAAGTTTGT 


TCAAGATTGG 


CTGTCAAGAA 


360 

«J V V 


TCATGGACTG 


TTATTATATG 


CCTTGTTTTC 

■ 


TGTCAAGACA 


CC ATG ATT 
Met lie 
-292 


CCT GGT 

Pro Gly 

-290 


414 



15 AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC 462 

ABn Arg Met Leu Met Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly 

-285 -280 -275 

GCG AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC 510 

Ala Ser His Ala Ser Leu He Pro Glu Thr Gly Lys Lys Lys Val Ala 
20 -270 -265 -260 

GAG ATT CAG GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG 558 

Glu He Gln Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu 

-250 -245 



25 CTC CTG CGG GAC TTC GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC 606 

Leu Leu Arg Asp Phe Glu Ala Thr Leu Leu Gin Met Phe Gly Leu Arg 

-240 -235 -230 -225 

■ 

CGC CGC CCG CAG CCT AGC AAG AGT GCC GTC ATT CCG GAC TAC ATG CGG 654 

Arg Arg Pro Gin Pro Ser Lys Ser Ala Val He Pro Asp Tyr Met Arg 

30 -220 -215 -210 

GAT~CTT TAC CGG CTT CAG TCT GGG GAG GAG GAG GAA GAG CAG ATC CAC 702 

Asp Leu Tyr Arg Leu Gin Ser Gly Glu Glu Glu Glu Glu Gin He His 

-205 -200 -195 

35 AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC AGC CGG GCC AAC ACC 750 

Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala Ser Arg Ala Asn Thr 
-190 -185 -180 

GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC CCA GGG ACC 798 
Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He Pro Gly Thr 
-175 -170 -165 



AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC CCT 846 
Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He Pro 
-160 -155 -150 -145 

GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG 89 4 

Glu Asn Glu Val He Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin 

-140 -135 " * -130 



50 



5$ 



21 



EP 0 592 562 B1 



-10 -5 1 

Gly Ala Ser Ser His cys Gin Lye Thr Ser Leu Arg Val Asn Phe Glu 

5 10 15 

Asp He Gly Trp Asp Ser Trp He He Ala Pro Lys Glu Tyr Asp Ala 

20 25 30 

Tyr Glu eye Lys Gly Gly Cys Phe Phe Pro Leu Ala Asp Asp Val Thr 

35 40 45 50 

Pro Thr Lys His Ala He Val Gin Thr Leu Val His Leu Glu Phe Pro 

55 60 65 

Thr Lys Val Gly Lys Ala Cys Cys Val Pro Thr Lys Leu Ser Pro He 

70 75 80 

Ser He Leu Tyr Lys Asp Asp Met Gly Val Pro Thr Leu Lys Tyr His 

85 90 95 

Tyr Glu Gly Met Ser Val Ala Glu Cys Gly Cys Arg 
100 105 110 



(2) INFORMATION FOR SEQ ID NO:3: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1954 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: double 

(D) TOPOLOGY: linear 

MOLECULE TYPE: cDNA to mRNA 
■(Hi) HYPOTHETICAL: NO 
(iv) ANTI-SENSE: NO 

(vi) .ORIGINAL SOURCE: 

(A) ORGANISM: Homo sapiens 

(G) CELL TYPE: Osteosarcoma Cell Line 

(H) CELL LINE: U-20S 

(vii) IMMEDIATE SOURCE: 

(A) LIBRARY: U20S cDNA in Lambda gtIO 

(B) CLONE: Lambda U20S-3 

(viii) POSITION IN GENOME: 

(C) UNITS: bp 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 403.. 1629 
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(ix) FEATURE: 

(A) NAME/KEY: mat_peptide 

(B) LOCATION: 1279.. 1626 

(ix) FEATURE: 

(A) NAME/KEY: mRNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 
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10 



15 



20 



25 



30 



35 



40 



45 



SO 



Metier Pro Gly Ala Phe Arg Val Ala Leu Leu Pro Leu Phe Leu Leu 



-315 



-310 



-305 



val cys Val^Thr Gin Gin Lys Pro Leu Gin Asn Trp Glu Gin Ala Ser 



-295 



-290 



Pro Gly^Glu Asn Ala His Server Leu Gly Leu Ser Gly Ala Gly Glu 



-280 



-275 



°Sn GlY Val Phe A8 P **\x Gin Met Phe Leu Glu Asn Met Lye Val Asp 

-265 -260 * . 



Phe Leu Arg Ser Leu Asn Leu Ser Gly He Pro Ser Gin Asp Lys Thr 

—250 - * e 



-245 



-240 



Arg Ala Glu Pro Pro Gin Tyr Met He Asp Leu Tyr Asn Arg Tyr Thr 

-"230 . - 



Thr Asp Lys Ser Ser Thr Pro Ala Ser Asn He Val Arg Ser Phe Ser 

- 22 0 -215 -210 

Val Glu Asp Ala lie Ser Thr Ala Ala Thr Glu Asp Phe Pro Phe Gin 
~ 205 -200 -195 

-190 HiS 116 116 Ph ?-- 8n Ile Ser Ile Pro Hl8 Glu 61 « He 



-185 



-180 



-175 



Thr Arg Ala Glu Leu Arg Leu Tyr Val Ser Cys Gin Asn Asp Val Asp 

-170 ■"- F 



-165 



-160 



Thr His Gly Leu Glu Gly ser Met Val Val Tyr Asp Val Leu Glu 

"155 -150 - 14S 

Asp ser Glu Thr Trp Asp Gin Ala Thr Gly Thr Lys Thr Phe Leu Val 
" 140 -135 _ I30 

Ser Asp Ile Asp Glu G1 y T ^P Glu Thr Leu Glu Val ser Ser 
" 125 -120 -us 

Ala Val Lys Arg Trp Val Arg Ala Asp Ser Thr Thr Asn Lys Asn Lys 



-105 



-100 



-95 



Leu Glu Val Thr Val Gin Ser His Arg Glu Ser Cys Asp Thr Leu Asp 

-90 -85 -eo 

lie Ser Val Pro Pro Gly Ser Lys Asn Leu Pro Phe Phe Val Val Phe 

~ 75 -70 -65 

Asn Asp Arg Ser Asn Gly Thr Lys Glu Thr Arg Leu Glu Leu Lys 

-60 -" 



-55 



-50 



Glu Met Ile Gly His Glu Gin Glu Thr Met Leu Val Lys Thr Ala Lvs 
-45 -40 -35 

Asn Ala Tyr Gin Val Ala Gly Glu Ser Gin Glu Glu Glu Gly Leu Asp 



-20 -is 

Gly Tyr Thr Ala Val Gly Pro Leu Leu Ala Arg Arg Lys Arg Ser Thr 
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10 



-15 -10 -5 

AGG AGC ACC GGA GCC AGC AGC CAC TGC GAG AAG ACT TCT CTC AGG GTG 160B 

Arg Ser Thr Gly Ala Ser Ser His Cys Gin Lys Thr Ser Leu Arg Val 
15 10 15 

AAC TTT GAG GAC ATC GGC TGG GAC AGC TGG ATC ATT GCA CCC AAG GAA 1656 

Asn Phe Glu Asp lie Gly Trp Asp Ser Trp He He Ala Pro Lye Glu 

20 25 30 

TAT GAC GCC TAT GAG TGT AAA GGG GGT TGC TTC TTC CCA TTG OCT GAT 1704 

Tyr Asp Ala Tyr Glu Cys Lys Gly Gly Cys Phe Phe Pro Leu Ala Asp 

35 40 45 

GAC GTG ACA CCC ACC AAA CAT GCC ATC GTG CAG ACC CTC GTG CAT CTC 1752 
75 Asp Val Thr Pro Thr Lys His Ala He Val Gin Thr Leu Val His Leu 

50 55 60 

GAG TTC CCC ACA AAG GTG GGC AAA GCC TGC TGC GTT CCC ACC AAA CTG 1800 

Glu Phe Pro Thr Ly6 Val Gly Lys Ala Cys Cys Val Pro Thr Lys Leu 

65 70 75 

20 

AGT CCC ATC TCC ATC CTC TAC AAG GAT GAC ATG GGG GTG CCA ACC CTC 1848 
Ser Pro He Ser He Leu Tyr Lys Asp Asp Met Gly Val Pro Thr Leu 
80 85 90 95 

AAG TAC CAC TAT GAG GGG ATG AGT GTG GCT GAG TGT GGG TGT AGG TAGTCCCTGC 19 C 

25 Lys Tyr His Tyr Glu Gly Met Ser Val Ala Glu Cys Gly Cys Arg 

100 105 110 



30 



35 



40 



AGCCACCCAG 


GGTGGGGATA 


CAGGACATGG 


AAGAGGTTCT 


GGTACGGTCC 


TGCATCCTCC 


1963 


TGCGCATGGT 


ATGCCTAAGT 


TGATCAGAAA 


CCATCCTTGA 


GAAGAAAAGG 


AGTTAGTTGC 


2023 


CCTTCTTGTG 


TCTGGTGGGT 


CCCTCTGCTG 


AAGTGACAAT 


GACTGGGGTA 


TGCGGGCCTG 


2083 


TGGGCAGAGC 


AGG AG ACC CT 


GGAAGGGTTA 


GTGGGTAGAA 


AGATGTCAAA 


AAGGAAGCTG 


2143 


TGG GT AG ATG 


ACCTGCACTC 


CAGTGATTAG 


AAGTCCAGCC 


TTACCTGTGA 


GAGAGCTCCT 


2203 


GGCATCTAAG 


AGAACTCTGC 


TTCCTCATCA 


TCCCCACCGA 


CTTGTTCTTC 


CTTGGGAGTG 


2263 


TGT C CTC AGG 


GAGAACAGCA 


TTGCTGTTCC 


TGTGCCTCAA 


GCTCCCAGCT 


GACTCTCCTG 


2323 


TGG CTC AT AG 


GACTGAATGG 


GGTGAGGAAG 


AGCCTGATGC 


CCTCTGGCAA 


TCAGAGCCCG 


2383 


AAGGACTTCA 


AAACATCTGG 


ACAACTCTCA 


TTGACTGATG 


CTCCAACATA 


ATTTTTAAAA 


2443 


AGAG 












2447 



45 

(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

so (A) LENGTH: 428 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 



(ii) MOLECULE TYPE: protein 
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10 



-255 -250 -245 

GAC AAA ACC AGA GCG GAG CCA CCC CAG TAC ATG ATC GAC TTG TAC AAC 888 

Asp Lys Thr Arg Ala Glu Pro Pro Gin Tyr Met He Asp Leu Tyr Asn 

•240 -235 -230 

AGA TAC ACA ACG GAC AAA TCG TCT ACG CCT GCC TCC AAC ATC GTG CGG 936 
Arg Tyr Thr Thr Asp Lys Ser Ser Thr Pro Ala Ser Asn He Val Arg 
-225 -220 -215 -210 

AGC TTC AGC GTG GAA GAT GCT ATA TCG ACA GCT GCC ACG GAG GAC TTC 984 

Ser Phe Ser Val Glu Asp Ala He Ser Thr Ala Ala Thr Glu Asp Phe 

-205 -200 -195 

CCC TTT CAG AAG CAC ATC CTG ATC TTC AAC ATC TCC ATC CCG AGG CAC 1032 
■ Pro Phe Gin Lys His He Leu He Phe Asn He Ser He Pro Arg His 
15 -190 -185 -180 

GAG CAG ATC ACC AGG GCT GAG CTC CGA CTC TAT GTC TCC TGC CAA AAT 1080 

Glu Gin He Thr Arg Ala Glu Leu Arg Leu Tyr Val Ser Cys Gin Asn 
-175 -170 -165 

20 GAT GTG GAC TCC ACT CAT GGG CTG GAA GGA AGC ATG GTC GTT TAT GAT 1128 

Asp Val Asp Ser Thr His Gly Leu Glu Gly Ser Met Val Val Tyr Asp 
-160 -155 -150 

GTT CTG GAG GAC AGT GAG ACT TGG GAC CAG GCC ACG GGG ACC AAG ACC 1176 

Val Leu Glu Asp Ser Glu Thr Trp Asp Gin Ala Thr Gly Thr Lys Thr 
25 -145 -140 -135 -130 

TTC TTG GTA TCC CAG GAC ATT CGG GAC GAA GGA TGG GAG ACT TTA GAA 1224 

Phe Leu Val Ser Gin Asp He Arg Asp Glu Gly Trp Glu Thr Leu Glu 

-125 -120 -115 

30 GTA TCG AGT GCC GTG AAG CGG TGG GTC AGG GCA GAC TCC ACA ACA AAC 1272 

Val Ser Ser Ala Val Lys Arg Trp Val Arg Ala Asp Ser Thr Thr Asn 

-110 -105 -100 

AAA AAT AAG CTC GAG GTG ACA GTG CAG AGC CAC AGG GAG AGC TGT GAC 132 0 

Lys Asn Lys Leu Glu Val Thr Val Gin Ser His Arg Glu Ser Cys Asp 

35 -95 -g 0 -85 

ACA CTG GAC ATC AGT GTC CCT CCA GGT TCC AAA AAC CTG CCC TTC TTT 1368 
Thr Leu Asp He Ser Val Pro Pro Gly Ser Lys Asn Leu Pro Phe Phe 
-80 -75 -70 

40 GTT GTC TTC TCC AAT GAC CGC AGC AAT GGG ACC AAG GAG ACC AGA CTG 1416 

Val Val Phe Ser Asn Asp Arg Ser Asn Gly Thr Lys Glu Thr Arg Leu 
-65 -60 -55 -50 

GAG CTG AAG GAG ATG ATC GGC CAT GAG CAG GAG ACC ATG CTT GTG AAG 14 64 

Glu Leu Lys Glu Met He Gly His Glu Gin Glu Thr Met Leu Val Lys 
45 -45 -40 -35 

ACA GCC AAA AAT GCT TAC CAG GTG GCA GGT GAG AGC CAA GAG GAG GAG 1512 
Thr Ala Lys Asn Ala Tyr Gin Val Ala Gly Glu Ser Gin Glu Glu Glu 

-30 -25 -20 

50 GGT CTA GAT GGA TAC ACA GCT GTG GGA CCA CTT TTA GCT AGA AGG AAG 1560 

Gly Leu Asp Gly Tyr Thr Ala Val Gly Pro Leu Leu Ala Arg Arg Lys 



55 



Claims 

1. A DNA sequence encoding a protein having the biological activity of a BMP-9 protein of inducing the formation of 
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cartilage and/or bone which sequence is 

(a) the DNA sequence from nucleotides 124 to 453 of SEQ ID No. 8; or 

(b) the DNA sequence from nucleotides 145 to 453 of SEQ ID No. 8; or 

5 (c) a DNA sequence which differs from the DNA sequence of (a) or (b) due to the degeneracies of the genetic 

code; 

(d) an allelic variant of the sequence of (a) or (b); or 

(e) a DNA sequence hybridizing under stringent conditions to the sequences of (a) or (b). 
10 2. A recombinant DNA molecule containing a DNA sequence according to claim 1 . 

m. » m — ■ 

3. The recombinant DNA molecule according to claim 2 wherein said DNA sequence is under the control of regulatory 
elements allowing its expression in a desired host cell. 

15 4. A host cell containing the recombinant DNA molecule according to claim 2 or 3. 

5. The host cell according to claim 4 which is a bacterial cell, a yeast cell or a mammalian cell. 

6. A method for the production of a protein having the biological activity of a BMP-9 protein comprising the cultivation 
20 of a host cell according to claim 4 or 5 under conditions appropriate for expression of said DNA sequence and 

recovering said protein from the culture. 

7. A protein encoded by the DNA sequence of claim 1 . 
25 8. A protein produced by the method of claim 6. 

9. A protein having the biological activity of a BMP-9 protein comprising one of the following amino acid sequences 

(a) the amino acid sequence from amino acids No. 8 to 110 as set forth in Fig. 3 (SEQ ID No. 9); or 
30 (b) the amino acid sequence from amino acids No. 1 to 110 as set forth in Fig. 3 (SEQ ID No. 9). 
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10. A protein having the biological activity of a BMP-9 protein wherein said protein is a dimer wherein each subunit 
comprises at least the amino acid sequence from amino acids No. 8 to 11 0 of Fig. 3 (SEQ ID No. 9) or at least the 
amino acid sequence from amino acids No. 1 to 110 of Fig. 3 (SEQ ID No. 9). 

11. A purified BMP-9 protein obtainable by the steps of 



(a) culturing a cell transformed with a cDNA comprising the nucleotide sequence from nucleotides No. 124 to 
453 as.shown in Fig. 3 (SEQ ID No...8);_and 

4 o (b) recovering and purifying from said culture medium a protein comprising the amino acid sequence from 

amino acids No. 1 to 110 as shown in Fig. 3 (SEQ ID No. 9). 

12. A purified BMP-9 protein obtainable by the steps of 

45 (a) culturing a cell transformed with a cDNA comprising the nucleotide sequence from nucleotides No. 1 24 to 

453 as shown in Fig. 3 (SEQ ID No. 8); and 

(b) recovering from said culture medium a protein comprising an amino acid sequence from amino acids No. 
8 to 110 as shown in Fig. 3 (SEQ ID No. 9). 

so 1 3. A pharmaceutical composition comprising an effective amount of a protein according to any one of claims 7 to 1 2, 
optionally in conjunction with a pharmaceutical^ acceptable vehicle. 

14. The composition of claim 13, further comprising a matrix for supporting said composition and providing a surface 
for bone and/or cartilage growth. 



15. The composition of claim 14 wherein said matrix comprises a material which is hydroxyapatite, collagen, polylactic 
acid or tricalcium phosphate. 
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16. The pharmaceutical composition of any one of claims 13 to 15 for wound healing, tissue repair, inducing bone 
growth or inducing cartilage growth. 

17. Use of a protein according to any one of claims 7 to 12 for preparing a pharmaceutical composition for inducing 
bone formation, cartilage formation, treatment of wounds or tissue repair. 

18. A method for the preparation of a DNA sequence encoding a protein having the biological activity of a BMP-9 
protein of inducing the formation of cartilage and/or bone which sequence is 

(a) the DNA sequence from nucleotides 124 to 453 of SEQ ID No. 8; or 

(b) the DNA sequence from nucleotides 145 to 453 of SEQ ID No. 8; or 

(c) a DNA sequence which differs from the DNA sequence of (a) or (b) due to degeneracies of the genetic code; 

(d) an allelic variant of the sequence of (a) or (b); or 

(e) a DNA sequence hybridizing under stringent conditions to the sequences of (a) or (b), 
said method comprising the following steps: 

(i) plating a human genomic library and preparing duplicate nitrocellulose replicas; 

(ii) hybridizing one set of the duplicate nitrocellulose replicas with the labeled oligonucleotide 

#1 : CTATGAGTGTAAAGGGGGTTGCTTCTTCCCATTGGCTGAT 
and the other set with the labeled oligonucleotide 



#2: GTGCCAACCCTCAAGTACCACTATGAGGGGATGAGTGTGG; 

and 

(-jjj)~j S olating-those-clones which hybridize-to both oligonucleotides and- determining the sequence of their 

inserts. 

19. A process for the manufacture of a composition according to claim 13, characterized in the use of the protein of 
any one of claims 7 to 12 as an essential constituent of said composition. 



Patentanspruche 

1. DNA-Sequenz, die ein Protein mit der biologischen Aktivitat der Induktion der Bildung von Knorpel und/oder Kno- 
chen eines BMP-9-Proteins codiert, wobei die Sequenz ist 

(a) die DNA-Sequenz von Nucleotid 124 bis 453 von SEQ ID No. 8; oder 

(b) die DNA-Sequenz von Nucleotid 145 bis 453 von SEQ ID No. 8; oder 

(c) eine DNA-Sequenz, die sich von der DNA-Sequenz nach (a) Oder (b) aufgrund der Degeneration des 
genetischen Codes unterscheidet; oder 

(d) eine allelische Variante der Sequenz nach (a) oder (b); oder 

(e) eine DNA-Sequenz, die unter stringenten Bedingungen mit den Sequenzen nach (a) oder (b) hybridisiert. 

2. Rekombinantes DNA-Molekul, das eine DNA-Sequenz nach Anspruch 1 enthalt. 

3. Rekombinantes DNA-Molekul nach Anspruch 2, wobei die DNA-Sequenz unter der Kontrolle von regulatorischen 
Elementen steht, die ihre Expression in einer gewunschten Wirtszelle erlauben. 

4. Wirtszelle, die das rekombinante DNA-Molekul nach Anspruch 2 oder 3 enthalt. 

5. Wirtszelle nach Anspruch 4, die eine Bakterienzelle, eine Hefezelle oder eine Saugerzelle ist. 
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6. Verfahren zur Herstellung eines Proteins mit der biologischen Aktivitat eines BMP-9-Proteins, umfassend die Zuch- 
tung einer Wirtszelle nach Anspruch 4 Oder 5 unter Bedingungen, die fur die Expression der DNA-Sequenz ge- 
eignet sind, und die Gewinnung des Proteins aus der Kultur. 

7. Protein, das von der DNA-Sequenz nach Anspruch 1 codiert wird. 

8. Protein, das durch das Verfahren nach Anspruch 6 hergestellt wird. 

9. Protein mit der biologischen Aktivitat eines BMP-9-Proteins, das eine der folgenden Aminosauresequenzen umfaBt 

(a) die Aminosauresequenz von Aminosaure Nr. 8 bis 110, die in Fig. 3 (SEQ ID No. 9) dargestellt ist; Oder 

(b) die Aminosauresequenz von Aminosaure Nr. 1 bis 110, die in Fig. 3 (SEQ ID No. 9) dargestellt ist. 

10. Protein mit der biologischen Aktivitat eines BMP-9-Proteins, wobei das Protein ein Dimer ist, in dem jede Unter- 
15 einheit mindestens die Aminosauresequenz von Aminosaure Nr. 8 bis 110 von Fig. 3 (SEQ ID No. 9) Oder minde- 

stens die Aminosauresequenz von Aminosaure Nr. 1 bis 110 von Fig. 3 (SEQ ID No. 9) umfaBt. 

11. Gereinigtes BMP-9-Protein, erhaltlich durch die Schritte 

20 (a) Zuchtung einer Zelle, die mit einer cDN A transformiert ist, die die Nucleotidsequenz von Nucleotid Nr. 1 24 

bis 453 umfaBt, die in Fig. 3 (SEQ ID No. 8) gezeigt ist; und 

(b) Gewinnung und Reinigung eines Proteins, das die Aminosauresequenz von Aminosaure Nr. 1 bis 110 
umfaBt, die in Fig. 3 (SEQ ID No. 9) gezeigt ist, aus dem Kulturmedium. 

25 12. Gereinigtes BMP-9-Protein, erhaltlich durch die Schritte 

(a) Zuchtung einer Zelle, die mit einer cDNA transformiert ist, die die Nucleotidsequenz von Nucleotid Nr. 1 24 
bis 453 umfaBt, die in Fig. 3 (SEQ ID No. 8) gezeigt ist; und 

(b) Gewinnung eines Proteins, das die Aminosauresequenz von Aminosaure Nr. 8 bis 110 umfaBt, die in Fig. 
^o 3 (SEQ ID No. 9) gezeigt ist, aus dem Kulturmedium. 

13. Arzneimittel, das eine wirksame Menge eines Proteins nach einem der Anspruche 7 bis 12 gegebenenfalls in 
verbindung mit einem pharmazeutisch vertraglichen Trager. umfaBt. 

35 14. Arzneimittel nach Anspruch 13, das weiter eine Matrix als Trager des Arzneimittels umfaBt und eine Oberflache 
fur Knochen- und/oder Knorpelwachstum bereitstellt. 

15. Arzneimittel nach Anspruch 14, wobei die Matrix ein Material umfaBt, das Hydroxyapatit, Collagen, Polymilchsaure 
oder Tricalciumphosphat ist. 

1 6. Arzneimittel nach einem der Anspruche 1 3 bis 1 5 zur Wundheilung, Gewebewiederherstellung, Induktion des Kno- 
chenwachstums oder Induktion des Knorpelwachstums. 

17. Verwendung eines Proteins nach einem der Anspruche 7 bis 12 zur Herstellung eines Arzneimittels zur Induktion 
45 der Knochenbildung oder der Knorpelbildung, zur Behandlung von Wunden oder zur Gewebewiedertierstellung. 

18. Verfahren zur Herstellung einer DNA-Sequenz, die ein Protein mit der biologischen Aktivitat der Induktion der 
Bildung von Knorpel und/oder Knochen eines BMP-9-Proteins codiert, wobei die Sequenz ist 

50 (a) die DNA-Sequenz von Nucleotid 124 bis 453 von SEQ ID No. 8; Oder 

(b) die DNA-Sequenz von Nucleotid 145 bis 453 von SEQ ID No. 8; oder 

(c) eine DNA-Sequenz, die sich von der DNA-Sequenz nach (a) oder (b) aufgrund der Degeneration des 
genetischen Codes unterscheidet; oder 

(d) eine allelische Variante der Sequenz nach (a) oder (b); oder 

55 (e) eine DNA-Sequenz, die unter stringenten Bedingungen mit den Sequenzen nach (a) oder (b) hybridisiert, 

wobei das Verfahren die folgenden Schritte umfaBt: 
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(i) Plattierung einer menschlichen genomischen Genbank und Herstellung von Nitrocellulose-Zweifachrepli- 

kas; 

(ii) Hybridisierung eines Satzes der Nitrocellulose-Zweifachreplikas mit dem markierten Oligonucleotid 



: C I A I GAGTGTAAAGGGGGTTGCTTCTTCCCATTGGCTGAT 



und des anderen Satzes mit dem markierten Oligonucleotid 



n2: G i GCCAACCCTCAAGTACCACTATGAGGGGATGAG TGTGG; 

und 

(iii) Isolierung derjenigen Clone, die mit beiden Oligonucleotiden hybridisieren, und Bestimmung der Sequenz 
ihrer Insertionen. 

19. Verfahren zur Herstellung eines Mittels nach Anspruch 13, dadurch gekennzeichnet, daG man ein Protein nach 
einem der AnsprOche 7 bis 12 als wesentlichen Bestandteil des Mittels verwendet. 



Revendications 

1. Sequence d'ADN codant pour une proline ayant I'activite biologique d'une proteine BMP-9 d'induire la formation 
de cartilage et d'os, laquelle sequence est 

(a) la sequence d'ADN des nucleotides 124 k 453 de SEQ ID No. 8; ou 

(b) la sequence d'ADN des nucleotides 145 k 453 de SEQ ID No. 8; ou 

(c) une sequence d'ADN qui differe de la sequence d'ADN de (a) ou (b) due aux d6gen6rescences du code 
g£netique; 

(d) une variante allele de la sequence de (a) ou (b); ou 

(e) une sequence d'ADN s'hybridant sous des conditions rigoureuses en les sequences de (a) ou (b). 

2. Molecule d'ADN recombinant contenant une sequence d'ADN suivant la revendication 1 . 

3. Molecule d'ADN recombinant suivant la revendication 2, dans laquelle la sequence d'ADN est sous le controle 
d'eiements regulateurs permettant son expression dans une cellule hote d6siree. 

4. Cellule hote contenant la molecule d'ADN recombinant suivant Tune ou Pautre des revendications 2 et 3. 

5. Cellule hote suivant la revendication 4, qui est une cellule bacterienne, une cellule de levure ou upe cellule mam- 
mitere. 

6. Proced6 de production d'une proteine ayant I'activite biologique d'une proteine BMP-9, comprenant la culture d'une 
cellule hote suivant I'une ou I'autre des revendications 4 et 5 sous des conditions appropriees pour I'expression 
de la sequence d'ADN pr6cit6e et la recuperation de ladite proteine de ia culture. 

7. Proteine cod6e par la sequence d'ADN de la revendication 1 . 

8. Proteine produite par le procede de la revendication 6. 

9. Proteine ayant I'activite biologique d'une proteine BMP-9 comprenant une des sequences d'acides amines 
suivantes : 

(a) la sequence d'acides amines allant des acides amines n° 8 k 110 telle que representee k la figure 3 (SEQ 
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ID No. 9); ou 

(b) la sequence d'acides amines allant des acides amines n° 1 k 110 telle que representee k la figure 3 (SEQ 
ID No. 9). 

5 10. Proteine ayant I'activite biologique d'une proteine BPM-9, dans laquelle ladrte proline est un dimere dans lequel 
chaque sous-unite comprend au moins la sequence d'acides amines allant des acides amines n° 8 k 110 de la 
figure 3 (SEQ ID No. 9) ou au moins la sequence d'acides amines allant des acides amines n° 1 k 110 de la figure 
3 (SEQ ID No. 9). 

10 11. Proteine BMP-9 purifi6e obtenable par les Stapes suivantes : 

(a) la culture d'une cellule transforms avec un ADNc comprenant la sequence nucleotidique allant des nu- 
cleotides n° 124 k n° 453 telle que representee k la figure 3 (SEQ ID No. 8); et 

(b) la recuperation et la purification dudit milieu de culture d'une proteine comprenant la sequence d'acides 
*s amines allant des acides amines n° 1 k 110 telle que representee a la figure 3 (SEQ ID No. 9). 

12. Proteine BMP-9 purifi6e obtenable par les etapes suivantes : 

(a) la culture d'une cellule transformee avec un ADNc comprenant la sequence nucleotidique allant des nu- 
20 cieotides n° 124 k n° 453 telle que representee k la figure 3 (SEQ ID No. 8); et 

(b) la recuperation dudit milieu de culture d'une proteine comprenant une sequence d'acides amines allant 
des acides amines n° 8 k 110 telle que representee a la figure 3 (SEQ ID No. 9). 

13. Composition pharmaceutique comprenant une quantite efficace d'une proteine suivant I'une quelconque des re- 
25 vendications 7 & 12, eventuellement conjointement k un v6hicule pharmaceutiquement acceptable. 

14. Composition suivant la revendication 13, comprenant de plus une matrice pour supporter ladite composition et 
former une surface pour la croissance d'os et/ou de cartilage. 

30 15. Composition suivant la revendication 1 4, dans laquelle ladite matrice comprend une matiere qui est une hydroxya- 
patite, du collagene, de I'acide polylactique ou du phosphate tricalcique. 

16. Composition pharmaceutique suivant I'une quelconque des revendications 13 k 15, pour cicatriser les blessures, 
r6parer les tissus, induire une croissance osseuse ou induire la croissance de cartilage. 

35 

17. Utilisation d'une proteine suivant I'une quelconque des revendications 7 k 12 pour preparer une composition phar- 
maceutique pour induire une formation osseuse, la formation de cartilage, le traitement de blessures ou la repa- 
ration de tissus. 

■ 

40 18. Procede de preparation d'une sequence d'ADN codant pour une proteine ayant I'actrvite biologique d'une proteine 
BMP-9 d'induire la formation de cartilage et/ou d'os, laquelle sequence est 

(a) la sequence d'ADN des nucleotides 124 k 453 de SEQ ID No. 8; ou 

(b) la sequence d'ADN des nucleotides 145 k 453 de SEQ ID No. 8; ou 

4 $ (c) une sequence d'ADN qui differe de la sequence d'ADN de (a) ou (b) due aux degenerescences du code 

gen6tique; 

(d) une variante allele de la sequence de (a) ou (b); ou 

(e) une sequence d'ADN s'hybridant sous des conditions rigoureuses en les sequences de (a) ou (b), 
50 ledit proc6d6 comprenant les etapes suivantes : 

(i) I'etalement d'une bibliotheque g6nomique humaine et la preparation de repliques de nitrocellulose dupli- 
qu6es; 

(ii) I'hybridation d'une serie des repliques de nitrocellulose dupliqu6es avec ('oligonucleotide marque 
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n° 1 : CTATGAGTGTAAAGGGGGTTGCTTCTTCCCATTGGCTGAT 
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et I'autre s6rie avec ('oligonucleotide marque 

n° 2 : GTGCCAACCCTCAAGTACCACTATGAGGGGATGAGTGTGG; et 

(iii) I'isolement de ces clones qui s'hybrident aux deux oligonucleotides et la determination de la sequence de 
leurs inserts. 

19. Proc6d6 de fabrication d'une composition suivant la revendication 13, caracteris6e par I'utilisation de la prot£ine 
suivant Tune quelconque des revendications 7 & 12 comme constituant essentiel de ladite composition. 
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Figure 1A 

10 20 30 40 50 60 70 

CATTAATAAA TATTAAGTAT TGGAATTAGT GAAATTGGAG TTCCTTGTGG AAGGAAGTGG GCAAGTGAGC 

80 90 100 110 120 130 140 

TTTTTAGTTT GTGTCGGAAG CCTGTAATTA CGGCTCCAGC TCATAGTGGA ATGGCTATAC TTAGATTTAT 

150 160 170 180 190 200 210 

GGATAGTTGG GTAGTAGGTG TAAATGTATG TGGTAAAAGG CCTAGGAGAT TTGTTGATCC AATAAATATG 

220 230 240 250 260 270 280 

ATT AG GG AAA CAATTATTAG GGTTCATGTT CGTCCTTTTG GTGTGTGGAT TAGCATTATT TGTTTGATAA 

290 300 310 320 330 340 350 

TAAGTTTAAC TAGTCAGTGT TGGAAAGAAT GGAGACGGTT GTTGATTAGG CGTTTTGAGG ATGGGAATAG 

360 370 380 390 400 410 420 

GATTGAAGGA AATATAATGA TGGCTACAAC GATTGGGAAT CCTATTATTG TTGGGGTAAT GAATGAGGCA 

430 440 450 460 470 480 490 

AATAGATTTT CGTTCATTTT AATTCTCAAG GGGTTTTTAC TTTTATGTTT GTTAGTGATA TTGGTGAGTA 

500 510 520 530 540 550 560 

GGCCAAGGGT TAATAGTGTA ATTGAATTAT AGTGAAATCA TATTACTAGA CCTGATGTTA GAAGGAGGGC 
570 580 590 600 609 618 

> 

TGAAAAGGCT CCTTCCCTCC CAGGACAAAA CCGGAGCAGG GCCACCCGG ATG TCC CCT GGG 

M S P G 

627 636 645 654 663 672 



GCC TTC CGG GTG GCC CTG CTC CCG CTG TTC CTG CTG GTC TGT GTC ACA CAG CAG 
AFRV ALLPLFLLV CVTQQ 

681 690 699 708 717 726 



AAG CCG CTG CAG AAC TGG GAA CAA GCA TCC CCT GGG GAA AAT GCC CAC AGC TCC 
KPLQ N WEQASPGE NAHSS 

735 744 753 762 771 780 



CTG GGA TTG TCT GGA GCT GGA GAG GAG GGT GTC TTT GAC CTG CAG ATG TTC CTG 
LGLS GAGEEGVFDLQMFL 

789 798 807 816 825 834 

GAG AAC ATG AAG GTG GAT TTC CTA CGC AGC CTT AAC CTC AGC GGC ATT CCC TCC 
ENMKVDFLRSLNLSGIPS 
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Figure IB 

843 852 861 870 879 888 



CAG GAC AAA ACC AGA GCG GAG CCA CCC CAG TAC ATG ATC GAC TTG TAC AAC AGA 
QDKTRAEPPQYMI D L Y N R 

897 906 915 924 933 942 



TAC ACA ACG GAC AAA TCG TCT ACG CCT GCC TCC AAC ATC GTG CGG AGC TTC AGC 
YTTDKSSTPASNIVRSPS 

951 960 969 978 987 996 



GTG GAA GAT GCT ATA TCG ACA GCT GCC ACG GAG GAC TTC CCC TTT CAG AAG CAC 
VED A I STAATE.DF PFQKH 

1005 1014 1023 1032 1041 105O 

ATC CTG ATC TTC AAC ATC TCC ATC CCG AGG CAC GAG CAG ATC ACC AGG GCT GAG 
ILI FNISIPRHEQ ITRAE 

1059 1068 1077 1086 1095 . 1104 



CTC CGA CTC TAT GTC TCC TGC CAA AAT GAT GTG GAC TCC ACT CAT GGG CTG GAA 
LRLY VSCQNDVDSTHGLE 

1113 1122 1131 1140 1149 1158 



GGA AGC ATG GTC GTT TAT GAT GTT CTG GAG GAC AGT GAG ACT TGG GAC CAG GCC 
GSMVVYDVLE.DSETWDQA 

1167 1176 1185 1194 1203 1212 



ACG GGG ACC AAG ACC TTC TTG GTA TCC CAG GAC ATT CGG GAC GAA GGA TGG GAG 
TGTKTFLVSQDIRDEGWE 

1221 1230 1239 1248 1257 1266 

ACT TTA GAA GTA TCG AGT GCC GTG AAG CGG TGG GTC AGG GCA GAC TCC ACA ACA 
TLEVSSAVKRWVRADSTT 

1275 1284 1293 1302 1311 1320 

AAC AAA AAT AAG CTC GAG GTG ACA GTG CAG AGC CAC AGG GAG AGC TGT GAC ACA 
NKNKLEVTVQSHRESCDT 

1329 1338 1347 1356 1365 1374 

CTG GAC ATC AGT GTC CCT CCA GGT TCC AAA AAC CTG CCC TTC TTT GTT GTC TTC 
LDISVPPGSKNLPKFVVF 
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Figure 1C 

1383 1392 1401 1410 1419 1428 

AGC AAT GGG ACC AAG GAG ACC AGA CTG GAG CTG AAG GAG ATG 
ND RS NGTKETRLELKEM 

1437 1446 1455 1464 1473 1482 

ATC GGC CAT GAG CAG GAG ACC ATG CTT GTG AAG ACA GCC AAA AAT GCT TAC CAG 
1 GHEQ ETMLVKTAKNAY Q 

1491 1500 150 9 1518 1527 1536 

GTG GCA GGf GAG AGC CAA GAG GAG GAG GGT CTA GAT GGA TAC ACA GCT GTG GGA 
VAGES QEEEGLDG YTAVG 

1545 1554 1563 1572 1581 159Q 



CCA CTT TTA GCT AGA AGG AAG AGG AGC ACC GGA GCC AGC AGC CAC TGC CAG AAG 

P LLARRK RSTGA S S HCQ K 

(319) (326) 

1599 1608 1617 1626 1635 1644 



ACT TCT CTC AGG GTG AAC TTT GAG GAC ATC GGC TGG GAC AGC TGG ATC ATT GCA 
TSLRVNFED.IGWDSWII A 

1653 1662 1671 1680 1689 1698 



CCC AAG GAA TAT GAC GCC TAT GAG TGT AAA GGG GGT TGC TTC TTC CCA TTG GCT 

P KEY D AY E CKGGC F FP L A 

a 

1707 1716 1725 1734 1743 1752 



GAT GAC GTG ACA CCC ACC AAA CAT GCC ATC GTG CAG ACC CTG GTG CAT CTC GAG 
DDVTp TKHAIVQTLVHLE 

1761 1770 1779 1788 1797 1806 



TiC CCC ACA AAG GTG GGC AAA GCC TGC TGC GTT CCC ACC AAA CTG AGT CCC ATC 
FPT KV GKACCVPTKLSPI 

1815 1824 1833 1842 1851 1860 



TCC ATC CTC TAC AAG GAT GAC ATG GGG GTG CCA ACC CTC AAG TAC CAC TAT GAG 
S I L V K D D M GVPTL K Y H Y E 

1869 1378 1887 1903 1913 1923 



GGG ATG AGT GTG GCT GAG TGT GGG TGT AGG TAGTCCCTGC AGCCACCCAG GGTGGGGATA 
G M S V A E C G C R 

(428) 
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Figure ID 

1933 1943 1953 1963 1973 19B3 1993 

CAGGACATGG AAGAGGTTCT GGTACGGTCC TGCATCCTCC TGCGCATGGT ATGCCTAAGT TGATCAGAAA 

2003 2013 2023 2033 2043 2053 2063 

CCATCCTTGA GAAGAAAAGG AGTTAGTTGC CCTTCTTGTG TCTGGTGGGT CCCTCTGCTG AAGTGACAAT 

2073 2083 2093 2103 2113 2123 2133 

GACTGGGGTA TGCGGGCCTG TGGGCAGAGC AGGAGACCCT GGAAGGGTTA GTGGGTAGAA AGATGTCAAA 

2143 2153 2163 2173 2183 2193 2203 

AAGGAAGCTG TGGGTAGATG ACCTGCACTC CAGTGATTAG AAGTCCAGCC TTACCTGTGA GAGAGCTCCT 

2213 2223 2233 2243 2253 2263 2273 

GGCATCTAAG AGAACTCTGC TTCCTCATCA TCCCCACCGA CTTGTTCTTC CTTGGGAGTG TGTCCTCAGG 

2283 2293 2303 2313 2323 2333 2343 

GAGAACAGCA TTGCTGTTCC TGTGCCTCAA GCTCCCAGCT GACTCTCCTG TGGCTCATAG GACTGAATGG 

2353 2363 2373 2383 2393 2403 2413 

GGTGAGGAAG AGCCTGATGC CCTCTGGCAA TCAGAGCCCG AAGGACTTCA AAACATCTGG ACAACTCTCA 
2423 2433 2443 

TTGACTGATG CTCCAACATA ATTTTTAAAA AGAG 
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Figure 2 

10 20 30 40 50 60 70 

CTCTAGAGGG CAGAGGAGGA GGGAGGGAGG GAAGGAGCGC GGAGCCCGGC CCGGAAGCTA GGTGAGTGTG 

80 90 100 110 120 130 140 

GCATCCGAGC TGAGGGACGC GAGCCTGAGA CGCCGCTGCT GCTCCGGCTG AGTATCTAGC TTGTCTCCCC 

150 160 170 180 190 200 210 

GATGGGATTC CCGTCCAAGC TATCTCGAGC CTGCAGCGCC ACAGTCCCCG GCCCTCGCCC AGGTTCACTG 

220 230 240 250 260 270 280 

CAACCGTTCA GAGGTCCCCA GGAGCTGCTG CTGGCGAGCC CGCTACTGCA GGGACCTATG GAGCCATTCC 

290 300 310 320 330 340 350 

GTAGTGCCAT CCCGAGCAAC GCACTGCTGC AGCTTCCCTG AGCCTTTCCA GCAAGTTTGT TCAAGATTGG 

360 370 380 390 400 (1) 

CTGTCAAGAA TCATGGACTG TTATTATATG CCTTGTTTTC TGTCAAGACA CC ATG ATT CCT 

MET He Pro 

417 432 447 462 

GGT AAC CGA ATG CTG ATG GTC GTT TTA TTA TGC CAA GTC CTG CTA GGA GGC GCG 
Gly Asn Arg MET Leu MET Val Val Leu Leu Cys Gin Val Leu Leu Gly Gly Ala 

477 492 507 

AGC CAT GCT AGT TTG ATA CCT GAG ACG GGG AAG AAA AAA GTC GCC GAG ATT CAG 
Ser His Ala Ser Leu He Pro Glu Thr Gly Lys Lys Lys Val Ala Glu He Gin 

522 537 552 567 

GGC CAC GCG GGA GGA CGC CGC TCA GGG CAG AGC CAT GAG CTC CTG CGG GAC TTC 

Gly His Ala Gly Gly Arg Arg Ser Gly Gin Ser His Glu Leu Leu Arg Asp Phe 

582 597 612 627 

GAG GCG ACA CTT CTG CAG ATG TTT GGG CTG CGC CGC CGC CCG CAG CCT AGC AAG 
Glu Ala Thr Leu Leu Gin MET Phe Gly Leu Arg Arg Arg Pro Gin Pro Ser Lys 

642 657 672 

AGT GCC GTC ATT CCG GAC TAC ATG CGG GAT CTT TAC CGG CTT CAG TCT GGG GAG 
Ser Ala Val He Pro Asp Tyr MET Arg Asp Leu Tyr Arg Leu Gin Ser Gly Glu 

687 702 717 732 

GAG GAG GAA GAG CAG ATC CAC AGC ACT GGT CTT GAG TAT CCT GAG CGC CCG GCC 
Glu Glu Glu Glu Gin He His Ser Thr Gly Leu Glu Tyr Pro Glu Arg Pro Ala 

747 762 777 

AGC CGG GCC AAC ACC GTG AGG AGC TTC CAC CAC GAA GAA CAT CTG GAG AAC ATC 

Ser Arg Ala Asn Thr Val Arg Ser Phe His His Glu Glu His Leu Glu Asn He 

792 807 822 837 

CCA GGG ACC AGT GAA AAC TCT GCT TTT CGT TTC CTC TTT AAC CTC AGC AGC ATC 

Pro Gly Thr Ser Glu Asn Ser Ala Phe Arg Phe Leu Phe Asn Leu Ser Ser He 
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Figure 2A 

852 867 882 897 

CCT GAG AAC GAG GTG ATC TCC TCT GCA GAG CTT CGG CTC TTC CGG GAG CAG GTG 
Pro Glu Asn Glu Val lie Ser Ser Ala Glu Leu Arg Leu Phe Arg Glu Gin Val 

912 927 942 

GAC CAG GGC CCT GAT TGG GAA AGG GGC TTC CAC CGT ATA AAC ATT TAT GAG GTT 
Asp Gin Gly Pro Asp Trp Glu Arg Gly Phe His Arg He Asn He Tyr Glu Val 

957 972 987 1002 

ATG AAG CCC CCA GCA GAA GTG GTG CCT GGG CAC CTC ATC ACA CGA CTA CTG GAC 
MET Lys Pro Pro Ala Glu Val Val Pro Gly His Leu lie Thr Arg Leu Leu Asp 

1017 1032 1047 

ACG AGA CTG GTC CAC CAC AAT GTG ACA CGG TGG GAA ACT TTT GAT GTG AGC CCT 
Thr Arg Leu Val His His Asn Val Thr Arg Trp Glu Thr Phe Asp Val Ser Pro 

1062 1077 1092 1107 

GCG GTC CTT CGC TGG ACC CGG GAG AAG CAG CCA AAC TAT GGG CTA GCC ATT GAG 

Ala Val Leu Arg Trp Thr Arg Glu Lys Gin Pro Asn Tyr Gly Leu Ala He Glu 

1122 1137 1152 1167 

GTG ACT CAC CTC CAT CAG ACT CGG ACC CAC CAG GGC CAG CAT GTC AGG ATT AGC 
Val Thr His Leu His Gin Thr Arg Thr His Gin Gly Gin His Val Arg He Ser 

1182 1197 1212 

CGA TCG TTA CCT CAA GGG AGT GGG AAT TGG GCC CAG CTC CGG CCC CTC CTG GTC 
Arg Ser Leu Pro Gin Gly Ser Gly Asn Trp Ala Gin Leu Arg Pro Leu Leu Val 

1227 1242 1257 1272 

ACC TTT GGC CAT GAT GGC CGG GGC CAT GCC TTG ACC CGA CGC CGG AGG GCC AAG 

Thr Phe Gly His Asp Gly Arg Gly His Ala Leu Thr Arg Arg Arg Arg Ala Lys 

1287 1302 1317 

CGT AGC CCT AAG CAT CAC TCA CAG CGG GCC AGG AAG AAG AAT AAG AAC TGC CGG 
Arg Ser Pro Lys His His Ser Gin Arg Ala Arg Lys Lys Asn Lys Asn Cys Arr 

1332(311) 1347 1362 1377 

CGC CAC TCG CTC TAT GTG GAC TTC AGC GAT GTG GGC TGG AAT GAC TGG ATT GTG 

Arg His Ser Leu Tyr Val Asp Phe Ser Asp Val Gly Trp Asn Asp Trp He Val 

1392 1407 1422 143T 

GCC CCA CCA GGC TAC CAG GCC TTC TAC TGC CAT GGG GAC TGC CCC TTT CCA CT2 
Ala Pro Pro Gly Tyr Gin Ala Phe Tyr Cys His Gly Asp Cys Pro Phe Pro Leu 

1452 1467 1482 

GCT GAC CAC CTC AAC TCA ACC AAC CAT GCC ATT GTG CAG ACC CTG GTC AAT TCT 
Ala Asp His Leu Asn Ser Thr Asn His Ala He Val Gin Thr Leu Val Asn Ser 

1497 1512 1527 1542 

GTC AAT TCC AGT ATC CCC AAA GCC TGT TGT GTG CCC ACT GAA CTG AGT GCC AT Z 
Val Asn Ser Ser He Pro Lys Ala Cys Cys Val Pro Thr Glu Leu Ser Ala He 
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Figure 2B 

1557 1572 1587 

TCC ATG CTG TAG CTG GAT GAG TAT GAT AAG GTG GTA CTG AAA AAT TAT CAG GAG 

Ser MET Leu Tyr Leu Asp Glu Tyx Asp Lys Val Val Leu Lys Asn Tvr Gin G}u 

1602 1617 (408) 1636 1646 1656 

ATG GTA GTA GAG GGA TGT GGG TGC CGC TGAGATCAGG CAGTCCTTGA GGATAGACAG 

WET Val, Val glM <?lV Cys Gly cys Arg 

1666 1676 1686 1696 1706 1716 1726 

ATATACACAC CACACACACA CACCACATAC ACCACACACA CACGTTCCCA TCCACTCACC CACACACTAC 

1736 1746 1756 1766 1776 1786 1796 

ACAGACTGCT TCCTTATAGC TGGACTTTTA TTTAAAAAAA AAAAAAAAAA AATGGAAAAA ATCCCTAAAC 

1806~ 1816 1826 1836 1846" 1856 1866 

ATTCACCTTG ACCTTATTTA TGACTTTACG TGCAAATGTT TTGACCATAT TGATCATATA TTTTGACAAA 

1876 1886 1896 1906 1916 1926 1936 

ATATATTTAT AACTACGTAT TAAAAGAAAA AAATAAAATG AGTCATTATT TTAAAAAAAA AAAAAAAACT 

1946 

CTAGAGTCGA CGGAATTC 
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Figure 3 



TGA ACA AGA GAG TGC TCA AGA AGC TGT CCA AGG ACG GCT CCA CAG AGG 

* Thr Arg Glu Cys Ser Arg Ser Cys Pro Arg Thr Ala Pro Gin Arg 
-41 -40 -35 -30 

CAG GTG AGA GCA GTC ACG AGG AGG ACA CGG ATG GCG CAC GTG GCT GCG 
Gin Val Arg Ala Val Thr Arg Arg Thr Arg Met Ala His Val Ala Ala 
-25 -20 -15 -10 

GGG TCG ACT TTA GCC AGG CGG AAA AGG AGC GCC GGG GCT GGC AGC CAC 
Gly Ser Thr Leu Ala Arg Arg Lye Arg Ser Ala Gly Ala Gly Ser His 

-5 15 

TGT CAA AAG ACC TCC CTG CGG GTA AAC TTC GAG GAC ATC GGC TGG GAC 

cys Gin Lye Thr Ser Leu Arg Val Asn Phe Glu Asp lie Gly Trp Asp 
10 15 20 

AGC TGG ATC ATT GCA CCC AAG GAG TAT GAA GCC TAC GAG TGT AAG GGC 
Ser Trp lie lie Ala Pro Lys Glu Tyr Glu Ala Tyr Glu Cys Lys Gly 
25 30 35 

GGC TGC TTC TTC CCC TTG GCT GAC GAT GTG ACG CCG ACG AAA CAC GCT 
Gly Cys Phe Phe Pro Leu Ala Asp Asp Val Thr Pro Thr Lys His Ala 
40 45 50 55 

ATC GTG CAG ACC CTG GTG CAT CTC AAG TTC CCC ACA AAG GTG GGC AAG 

lie Val Gin Thr Leu Val His Leu Lys Phe Pro Thr Lys Val Gly Lys 

60 65 70 

GCC TGC TGT GTG CCC ACC AAA CTG AGC CCC ATC TCC GTC CTC TAC AAG 
Ala Cys Cys Val Pro Thr Ly6 Leu Ser Pro lie Ser Val Leu Tyr Lys 

75 B0 85 

GAT GAC ATG GGG GTG CCC ACC CTC AAG TAC CAT TAC GAG GGC ATG AGC 

Asp Asp Met Gly Val Pro Thr Leu Lys Tyr His Tyr Glu Gly Met Ser 
90 95 100 

GTG GCA GAG TGT GGG TGC AGG TAGTATCTGC CTGCGGG 
Val Ala Glu Cys Gly Cys Arg 
105 110 
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